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Description 

IDENTIFICATION OF NOVEL MS4A GENE FAMILY MEMBERS 



EXPRESSED BY HEMATOPOIETIC CELLS 



5 



Cross Reference to Related Applications 



This application is based on and claims priority to United States 
Provisional Application Serial Number 60/254,362, filed December 8, 2000, 
and United States Provisional Application Serial No. 60/270,057 filed 
February 20, 2001 , herein incorporated by reference in their entirety. 



This work was supported by NIH grants CA-81776 and CA-54464. 
Thus, the U.S. Government has rights in the invention. 



The present invention generally relates to a new class of MS4A 
proteins characterized by a membrane-embedded structure. More 
particularly, the present invention provides MS4A nucleic acid and 
polypeptide sequences, chimeric genes comprising disclosed MS4A 
20 sequences, antibodies that specifically recognize MS4A polypeptides, and 
uses thereof. 



10 



Grant Statement 



15 



Field of the Invention 



Table of Abbreviations 



ATCC 



American Tissue Culture Collection 



25 



30 



GFP 



CD20 



FceRip 



htgs 



HTm4 



MS4A family 



CD20 B lymphocyte differentiation antigen 
high-affinity IgE receptor p chain 
green fluorescent protein 
GenBank human genomic database 
hematopoietic CD20-like antigen 
membrane spanning 4-domain family, 
subfamily A 
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Background Art 

CD20, FceRIp, and HTm4 are three cell surface proteins expressed 
by hematopoietic cells that represent members of a nascent gene family 
• (Adra et al. (1999) Clin Genet 55:431-437, Kinet (1999) Anna Rev Immunol 
5 17:931-972; Tedder and Engel (1994) Immunol Today 15:450-454). The 
deduced amino acid sequence of human and mouse CD20 first 
demonstrated a cell surface protein containing four membrane-spanning 
regions, N- and C-terminal cytoplasmic domains, and an ~50 amino acid 
loop that serves as the extracellular domain (Einfeld et al. (1988) EMBO J 

10 7:711-717; Stamenkovic and Seed (1988) J Exp Med 167:1975-1980; 
Tedder et al. (1988a) J Immunol 141:4388-4394; Tedder et al. (1988b) Proc 
Natl Acad Sci USA 85:208-212). Human CD20 shares 20% amino acid 
sequence identity with FceRIp and HTm4 (Adra et al. (1994) Proc Natl Acad 
Sci USA 91:10178-10182, Kuster et al. (1992) J Biol Chem 267:12782- 

15 12787). Moreover, these three proteins have a similar overall structure in 
man, mouse, and rat with significant sequence identity within the first three 
membrane-spanning domains (Kinet et al. (1988) Proc Natl Acad Sci USA 
85:6483-6487; Ra et al. (1989) Nature 19:1771-7; Tedder et al., 1988a). In 
addition, all three genes are located in the same region of human 

20 chromosome 11q12-13.1 (Adra et al., 1994; Hupp et al. (1989) J Immunol 
143:3787-3791; Tedder et al. (1989a) J Immunol 142:2555-2559) and 
mouse chromosome 19 (Hupp et al. 1989; Tedder et al., 1988a). These 
three genes are therefore likely to have evolved from a common precursor. 

Despite structural and sequence conservation between CD20, FceRIp 

25 and HTm4, transcription of each gene is differentially regulated. CD20 is 
only expressed by B lymphocytes (Stashenko et al. (1980) J Immunol 
125:1678-1685; Tedder et al., 1988a). FceRIp is expressed by mast cells 
and basophils (Kinet, 1999). HTm4 is expressed by diverse lymphoid and 
myeloid origin hematopoietic cells (Adra et al., 1994). 

30 Although the function of HTm4 remains unexplored, CD20 and FceRIp 

have critical roles in cell signaling. CD20 forms a homo- or hetero-tetrameric 
complex that is functionally important for regulating cell cycle progression 
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and signal transduction in B lymphocytes (Tedder and Engel, 1994). CD20 
additionally regulates transmembrane Ca ++ conductance, possibly as a 
functional component of a Ca ++ -permeable cation channel (Bubien et al. J 
Cell Biol 121:1121-1132; Kanzaki et al. (1997a) J Biol Chem 272:14733- 
5 14739; Kanzaki et al. (1997b) J Biol Chem 272:4964-4969; Kanzaki et al. 

(1995) J Biol Chem 270:13099-13104). FcsRIp is part of a tetrameric 
receptor complex consisting of a, p, and two y chains (Blank et al. (1989) 
Nature 337:187-189). FcsRIp mediates interactions with IgE-bound antigens 
that lead to cellular responses such as the degranulation of mast cells. 

10 Specifically, the FcsRIp subunit functions as an amplifier of FcsRip-mediated 
activation signals (Dombrowicz et al. (1998) Immunity 8:517-529; Lin et al. 

(1996) Cell 85:985-995). Because of their unique structure and sequence 
homology, CD20, FcsRIp, and HTm4 are likely to share overlapping 
functional properties. 

15 CD20 and FcsRIp are also important clinically. Antibodies against 

CD20 are effective in treating non-Hodgkin's lymphoma (McLaughlin et al. 
(1998) Oncology 12:1763-1769; Onrust et al. (1989) J Biol Chem 
264:15323-15327; Weiner (1999) Semin Oncol 26:43-51). Genetic 
variations at chromosome 1 1q12-13 can also play a role in the pathogenesis 

20 of allergic diseases (Adra et al., 1999; Kinet, 1999). Recent studies suggest 
that FcsRIp contributes to such diseases, and other genetic elements in this 
region likely also contribute to allergic disease. 

Since CD20, FcsRIp, and HTm4 are likely to have evolved by 
duplication of an ancestral gene, other related proteins might exist that form 

25 additional receptor complexes. In view of the clinical importance noted 
above, the identification of such proteins thus represents a long-felt and 
ongoing need in the art. To address this need, applicants have identified 
novel human and mouse proteins that span the cell membrane at least four 
times and share high levels of amino acid sequence identity with CD20, 

30 FcsRIp, and HTm4. This finding reveals a new gene family that has been 
designated herein as the MS4A family (membrane spanning 4-domain 
family, subfamily A). Currently this family contains at least 10 subgroups 
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(MS4A1 through MS4A12) that encode at least 21 previously unidentified 
human and mouse proteins expressed by hematopoietic cells and by diverse 
cell types in non-hematopoietic tissues. 

5 Summary of the Invention 

The present invention discloses isolated MS4A polypeptides and 
isolated nucleic acid molecules encoding the same. Preferably, an isolated 
MS4A polypeptide, or functional portion thereof, comprises a polypeptide 
encoded by the nucleic acid molecule of any one of the odd numbered SEQ 

10 ID NOs:1-37 a polypeptide encoded by a nucleic acid molecule that is 
substantially identical to any one of the odd-numbered SEQ ID NOs:1-37, a 
polypeptide fragment encoded by a 20 nucleotide sequence that is identical 
to a contiguous 20 nucleotide sequence of any one of the odd-numbered 
. SEQ ID NOs:1-37, a polypeptide having an amino acid sequence of any one 

15 of the even-numbered SEQ ID NOs:2-38, a polypeptide that is a biological 
equivalent of any one of the even-numbered SEQ ID NOs:2-38, or a 
polypeptide that is immunologically cross-reactive with an antibody that 
shows specific binding with a polypeptide comprising some or all amino 
acids of any one of the even-numbered SEQ ID NOs:2-38. 

20 The present invention further teaches chimeric genes having a 

heterologous promoter that drives expression of a nucleic acid sequence 
encoding a MS4A polypeptide. Preferably, the chimeric gene is carried in a 
vector and introduced into a host cell so that a MS4A polypeptide of the 
present invention is produced. Preferred host cells include but are not 

25 limited to a bacterial cell, a hamster cell, a mouse cell, or a human cell. 

In another aspect of the invention, a method is provided for detecting 
a nucleic acid molecule that encodes a MS4A polypeptide. According to the 
method, a biological sample having nucleic acid material is hybridized under 
stringent hybridization conditions to a MS4A nucleic acid molecule of the 

30 present invention. Such hybridization enables a nucleic acid molecule of the 
biological sample and the MS4A nucleic acid molecule to form a detectable 
duplex structure. Preferably, the MS4A nucleic acid molecule includes some 
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or all nucleotides of any one of the odd-numbered SEQ ID NOs:1-37. Also 
preferably, the biological sample comprises human nucleic acid material. 

The present invention further teaches an antibody that specifically 
recognizes a MS4A polypeptide. Preferably, the antibody recognizes some 
5 or all amino acids of any one of the even-numbered SEQ ID NOs:2-38. A 
method for producing a MS4A antibody is also disclosed, and the method 
comprises recombinantly or synthetically producing a MS4A polypeptide, or 
portion thereof; formulating the MS4A polypeptide so that it is an effective 
immunogen; immunizing an animal with the formulated polypeptide to 

10 generate an immune response that includes production of MS4A antibodies; 
and collecting blood serum from the immunized animal containing antibodies 
that specifically recognize a MS4A polypeptide. Antibody-producing cells 
can be optionally fused with an immortal cell line whereby a monoclonal 
antibody that specifically recognizes a MS4A polypeptide can be selected. 

15 Preferably, the MS4A polypeptide used as an immunogen includes some or 
all amino acid sequences of any one the even-numbered SEQ ID NOs:2-38. 

A method is also provided for detecting a level of MS4A polypeptide 
using an antibody that specifically recognizes a MS4A polypeptide. 
According to the method, a biological sample is obtained from an 

20 experimental subject and a control subject, and a MS4A polypeptide is 
detected in the sample by immunochemical reaction with the MS4A 
antibody. Preferably, the antibody recognizes amino acids of any one of the 
even-numbered SEQ ID NOs:2-38, and is prepared according to a method 
of the present invention for producing such an antibody. 

25 The present invention further discloses a method for identifying a 

compound that modulates MS4A function. The method comprises: exposing 
an isolated MS4A polypeptide to one or more compounds, and assaying 
binding of a compound to the isolated MS4A polypeptide. A compound is 
selected that demonstrates specific binding to the isolated MS4A 

30 polypeptide. Preferably, the MS4A polypeptide used in the binding assay of 
the method includes some or all amino acids of any one of the even- 
numbered SEQ ID NOs:2-38. 
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Also provided is a method for identifying a regulator of MS4A gene 
expression. The method comprises (a) exposing a cell sample with a 
candidate compound to be tested, the cell sample containing at least one 
cell containing a DNA construct comprising a modulatable transcriptional 
5 regulatory sequence of a MS4A-encoding nucleic acid and a reporter gene 
which is capable of producing a detectable signal; (b) evaluating an amount 
of signal produced in relation to a control sample; and (c) identifying a 
candidate compound as a modulator of MS4A gene expression based on the 
amount of signal produced in relation to a control sample. Preferably, the 

10 modulatable transcriptional regulatory sequence of a MS4A-encoding 
nucleic acid comprises a sequence that is immediately upstream of the initial 
coding region of a MS4A gene as set forth in any one of SEQ ID NOs:73-81 . 

The present invention further provides a method for modulating MS4A 
function in a subject. According to the method, a pharmaceutical 

15 composition is prepared that includes a substance capable of modulating 
MS4A expression or function, and a carrier. An effective dose of the 
pharmaceutical composition is administered to a subject, whereby MS4A 
activity is altered in the subject. Provided are therapeutic methods wherein 
a change in MS4A activity comprises a shift in the abundance of cell 

20 subpopulations expressing said protein, modulation of [Ca 2+ ]j levels, or 
altered cell function. In a preferred embodiment, the substance used to 
perform this method shows specific binding to some or all amino acids of 
any one of the even-numbered SEQ ID NOs:2-38, and was discovered by a 
method of the present invention. In another embodiment, MS4A function is 

25 disrupted by immunizing a subject with an effective dose of the disclosed 
MS4A polypeptide. The immune system of the subject produces an 
antibody that specifically recognizes the MS4A polypeptide, and preferably 
recognizes some or all of amino acids of any one of the even-numbered 
SEQ ID NOs:2-38. In a further embodiment, a gene therapy vector is used, 

30 the vector comprising a nucleotide sequence encoding a MS4A polypeptide. 
Alternatively, the gene therapy vector comprises a nucleotide sequence 
encoding a nucleic acid molecule, a peptide, or a protein that interacts with a 
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MS4A nucleic acid or polypeptide. Preferably, the subject is a human 
subject. 

Accordingly, it is an object of the present invention to provide novel 
MS4A nucleic acid and polypeptide sequences, and novel methods relating 
5 thereto. This object is achieved in whole or in part by the present invention. 

An object of the invention having been stated above, other objects 
and advantages of the present invention will become apparent to those 
skilled in the art after a study of the following description of the invention, 
Figures and non-limiting Examples. 

10 

Brief Description of the Drawings 
Figure 1 depicts cDNAs encoded by fifteen new human or mouse 
MS4A gene products. Consensus sequences from cDNAs and overlapping 
ESTs are indicated by their GenBank Accession numbers. Representative 

1 5 full-length cDNAs for each gene product are shown, except for MS4a3 which 
was not full-length. 5' and 3' untranslated sequences are shown as 
horizontal lines with relative nucleotide lengths shown. Coding regions are 
shown as boxes with translation initiation and termination codons and their 
relative nucleotide locations shown. Poly(A) attachment signal sequences 

20 (AATAAA) are indicated when known. Deduced hydrophobic regions are 
shown as filled boxes with the predicted membrane-spanning domains 
shown as TM1-TM4. Additional hydrophobic regions in MS4A4 proteins are 
shown as shaded boxes. Sites of putative nucleotide polymorphisms in 
MS4A6A are indicated by two (X)s. 

25 Figure 2 depicts exon-intron organization of the human MS4A genes. 

The maps were constructed by aligning known and predicted MS4A cDNA 
sequences with human genomic sequences as described in Materials and 
Methods. Exons are shown as boxes with the predicted translation initiation 
codons (ATG), transmembrane domains (TM) and termination codons 

30 indicated on the top. All exon and intron distances are shown to scale. 
Gaps indicate where intron distances have not been determined for MS4A3, 
MS4A4A, and MS4A12. Two long introns present in MS4A4E are not to 
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scale but the intron lengths are indicated. Exon numbering for MS4A1, and 
MS4A2 is as published (Kuster et al., 1992; Tedder et al., 1988a; Tedder et 
al., 1988b). 

Figure 3 shows human MS4A4E protein and transcript sequences 
5 predicted from genomic DNA sequences. MS4A4E sequences are 
compared with human MS4A4A cDNA (disclosed herein) and genomic 
sequences. Gaps were introduced to provide optimal alignment. The boxed 
AAC sequence near the 5' end of the MS4A4A sequence indicates the 
length of the most 5' MS4A4A cDNA sequence. Sequences upstream of this 

10 are based on contiguous genomic DNA sequences. Nucleotide numbering 
is based on the MS4A4A cDNA sequence, disclosed herein. Predicted 
translation initiation codons are shaded. Predicted membrane-spanning 
regions are underlined. An asterisk indicates predicted translation 
termination codons. Potential poly-A attachment signal sequences 

1 5 (AATAAA) are boxed. 

Figure 4 shows human MS4A6E protein and transcript sequences 
predicted from genomic DNA and overlapping cDNA sequences. 
PredictedMS4>46E transcript sequences are compared with human MS4A6A 
cDNA sequence (disclosed herein). Gaps were introduced in the nucleotide 

20 sequence to provide optimal alignment. The 5' end of both transcripts start 
at 3' splice-acceptor sites which demark the first translated exons for both 
genes. The 5' end of the putative MS4A6E transcript is based on genomic 
DNA sequence, while the predicted sequences starting at nucleotide 60 
were based on both genomic DNA sequences and overlapping cDNA 

25 sequences. A gap in the MS4A6A sequence is indicated where TM1/2 and 
TM2 exons are not found in MS4A6E transcripts. MS4A6A nucleotide 
numbering is based on the cDNA sequence (disclosed herein). Predicted 
translation initiation codons are shaded. Predicted membrane-spanning 
regions are underlined. An asterisk indicates the predicted translation 

30 termination codon for the MS4A6E protein. 

Figure 5 shows human MS4A10 protein and transcript sequences 
predicted from human genomic DNA sequences. MS4A10 nucleotide 
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sequence is compared with mouse MS4a10 cDNA sequence (disclosed 
herein). The 5' end of both transcripts start at 3' splice-acceptor sites which 
demark the first translated exons for both genes. MS4a10 nucleotide 
numbering is based on the cDNA sequence (disclosed herein). Predicted 
5 . translation initiation codons are shaded. Predicted membrane-spanning 
regions are underlined. An asterisk indicates predicted translation 
termination codon for the MS4A10 protein. Potential poly-A attachment 
signal sequences (AATAAA) are boxed. 

Figure 6 depicts a physical linkage map for the MS4A genes. A 

10 scheme for chromosome 11 structure is shown on the left with the mapped 
locations for MS4A1, MS4A2 and MS4A3 indicated. Representative human 
BAC clones are shown as vertical black bars with clone names shown on the 
top and clone size shown at the bottom. All distances are shown to the 
indicated scale. The distance between and spatial relationship of RP11- 

15 312N17 to the four other overlapping BACs shown at the bottom are 
unknown. Thin bars indicate continuous characterized (mapped or 
sequenced) regions of DNA that contain identified MS4A genes. When the 
relative position of this region of DNA is known relative to the representative 
BACs that are shown, the thin bars overlay the BAC. The mapped position 

20 of each MS4A gene is indicated on the right with the relative direction of 
gene translation indicated by arrows ( — ►). In some cases, approximate 
distances between MS4A genes (termination codons to the translation 
initiation codon for the next gene) are indicated in base pairs (bp). In some 
cases, approximate MS4A gene size is indicated showing the distance 

25 between predicted translation initiation codons and translation termination 
codons as show in Figure 7. 

Figure 7 depicts deduced amino acid sequences for CD20 (human 
A1, SEQ ID NO:40; mouse a1, SEQ ID NO:48), FceRip (human A2, SEQ ID 
NO:42; mouse a2, SEQ ID NO:50), HTm4 (human A3, SEQ ID NO:44; 

30 mouse a3, SEQ ID NO:20), and 19 new MS4A (human) (even-numbered 
SEQ ID NOs:2-18, 46) and MS4a (mouse and pig) proteins (even-numbered 
SEQ ID NOs:22-38, 56). Gaps were introduced to optimize alignments. 
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Numbers represent predicted residue positions. The predicted membrane- 
spanning regions (TM1-TM4) are indicated. Predicted intron|exon splice 
junctions are indicated by vertical bars where information was available. 
Amino acids common to 10 or more proteins are shaded, indicates partial 
5 sequence for the MS4a3 protein. CD20, FceRip, and HTm4 sequences and 
known intron|exon borders (SEQ ID NOs:39-44, 47-50) are as published 
(Adra et al., 1994; Kuster et al. B 1992; Ra et al., 1989; Tedder et al., 1988a; 
Tedder et al., 1989b; Tedder et al., 1988b). MS4A12 represents a 
conceptual translation (SEQ ID NO:46) of a human colon mucosa cDNA 

10 sequence (GenBank AK000224, SEQ ID NO:45), and MS4a12 represents a 
conceptual translation (SEQ ID NO:56) of a homologous cDNA sequence 
from pig (GenBank AJ236932, SEQ ID NO:55). 

Figure 8 depicts UPGMA (unweighted pair group method using 
arithmetic averages) tree of deduced MS4A and MS4a protein sequences. 

15 Horizontal tree branch length is a measure of sequence relatedness. For 
example, MS4a4B and MS4a4C are the most similar in sequence, while 
CD20 (MS4A1) sequences were the most divergent from other family 
members. The MS4a12p sequence was from pig, while all other MS4a 
sequences were from mouse. The UPGMA tree was generated using 

20 Geneworks version 2.0 (IntelliGenetics, Inc., Mountain View, California, 
USA). 

Figure 9 shows immunofluorescent detection of CD20 expression 
during B cell development. Single cell suspensions of leukocytes were 
isolated from wild-type mice, stained using MB20-13 (visualized using a PE- 

25 conjugated, anti-mouse lgG3 antibody) and anti-B220 (FITC-conjugated) 
monoclonal antibodies, and examined by two-color immunofluorescence 
staining with flow cytometry analysis. Quadrant gates indicate negative and 
positive populations of cells as determined using isotype-matched control 
monoclonal antibodies. The gated cell populations correspond to the cells 

30 described in Table 7, and are shown for reference. These results are 
representative of those obtained with six (6) two month-old wild type mice. 
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Figure 10 summarizes the strategy for targeted disruption of the 
mouse CD20 gene. 

Figure 10A shows genomic clones encoding CD20. 

Figure 10B shows the intron-exon organization of the wild typeCD20 
5 allele containing exons 5-8 (shaded squares). 

Figure 10C shows the structure of the CD20 targeting vector. 

Figure 10D shows the predicted structure of the CD20 allele after 
gene targeting in ES cells by homologous recombination. The EcoR V 
restriction site in exon 6 is deleted as indicated. 
10 Figure 10E presents Southern blot analysis of tail DNA from two wild 

type and four CD20 _/ " mice. Genomic DNA was digested with EcoR V, 
transferred to nitrocellulose and hybridized with the 5' probe indicated in (D). 

Figure 10F shows PCR amplification of genomic DNA from wild type 
and CD20' A mice using primers that bind in exons 6 and 7. Amplification of 
15 glyceraldehyde-3-phosphate dehydrogenase (G3PDH) is shown as a 
positive control. 

Figure 10G shows PCR amplification of cDNA generated from splenic 
RNA of wild type and CD20 _/ " mice. Each reaction mixture contained a 
sense primer that hybridized with sequences encoded by exon 3 and 
20 antisense primers that hybridized with either exon 6 or Neo r gene promoter 
sequences. 

Figures 10H and 101 show reactivity of the MB20-13 monoclonal 
antibody with CD20 cDNA-transfected (thick line) or untransfected (dashed 
line) 300.19 cells (Figure 10H) or Chinese Hamster Ovary (CHO) cells 
25 (Figure 101). The thin lines represent CD20 cDNA-transfected cells stained 
with secondary antibody alone or an isotype-control monoclonal antibody. 
Indirect immunofluorescence staining was visualized by flow cytometry 
analysis. 

Figure 10J shows immunofluorescent staining of splenocytes from 
30 CD20" /_ or wild type mice with MB20-13 (visualized using a PE-conjugated, 
anti-mouse lgG3 antibody) and anti-B220 (FITC-conjugated) monoclonal 
antibodies. Splenocytes from CD20' A mice generated histograms identical to 
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those obtained without MB20-1 monoclonal antibody present, using the 
secondary antibody alone. 

Figure 11 depicts immunofluorescent detection of B lymphocyte 
subpopulations in CD20" A and wild type mice. Lymphocytes were isolated 
5 and examined by two color immunofluorescent staining with flow cytometry 
analysis. Quadrants delineated by squares indicate negative and positive 
populations of cells as determined using unreactive monoclonal antibody 
controls. The gated cell populations correspond to the cells described in 
Table 7 that represent at least 6 mice of each genotype. 

10 Figure 12 shows altered signal transduction in CD20~ / ~ B cells. Figure 

12 also shows CD19 expression by splenocytes from CD20' /_ (thin line) and 
wild type (thick line) mice. Immunofluorescence staining using PE- 
conjugated anti-CD19 monoclonal antibody with flow cytometry analysis. 
The dashed line represents staining of wild type splenocytes with a control 

15 antibody. 

Figure 12A presents calcium responses induced by BCR or CD19 
ligation in CD20 V " and wild type B cells. Splenocytes were loaded with 1 pM 
indo-1-AM ester and B cells were stained with FITC-conjugated anti-B220 
antibody. At 1 min (arrow), optimal concentrations of goat anti-IgM F(ab') 2 

20 antibody fragments, anti-CD19 monoclonal antibody or Thapsigargin were 
added, with or without EGTA present. Increased ratios of indo-1 
fluorescence indicate increased [Ca 2+ ]j. Results represent those from at 
least four experiments. 

Figure 12B presents assays of tyrosine phosphorylation of proteins 

25 from purified splenic B cells of CD20" /_ and wild type mice. B cells (2 x 
10 7 /sample) were incubated with anti-IgM antibody for the times shown and 
detergent lysed. Proteins were resolved by SDS-PAGE, transferred to 
nitrocellulose and immunoblotted with anti-phosphotyrosine (anti-PTyr) 
antibody. The blot was stripped and reprobed with anti-SHP-1 antibody as a 

30 control for equivalent protein loading. Western blots from two of three 
experiments are shown to demonstrate the range of results. 
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Detailed Description of the Invention 
The present invention provides isolated nucleic acids encoding MS4A 
polypeptides (representative embodiments set forth as the odd-numbered 
SEQ ID NOs:1-37), isolated MS4A polypeptides (representative 
5 embodiments set forth as the even-numbered SEQ ID NOs:2-38), and uses 
thereof. The disclosed MS4A nucleic acids and polypeptides can be used 
according to methods of the present invention for drug discovery screens, for 
therapeutic treatment of atopic conditions, and for therapeutic regulation of 
[Ca 2+ ]i levels, among other uses. 

10 

JL DEFINITIONS 

While the following terms are believed to be well understood by one 
of ordinary skill in the art, the following definitions are set forth to facilitate 
explanation of the invention. The entire contents of all publications 
15 mentioned herein, including the discussion of the background art presented 
above, are hereby fully incorporated by reference. 

I .A. MS4A nucleic acids 

The nucleic acid molecules provided by the present invention include 
the isolated nucleic acid molecules of any one of the odd-numbered SEQ ID 

20 NOs:1-37, sequences substantially similar to sequences of any one of the 
odd-numbered SEQ ID NOs:1-37, conservative variants thereof, 
subsequences and elongated sequences thereof, complementary DNA 
molecules, and corresponding RNA molecules. The present invention also 
encompasses genes, cDNAs, chimeric genes, and vectors comprising 

25 disclosed MS4A nucleic acid sequences. 

The term "nucleic acid molecule" refers to deoxyribonucleotides or 
ribonucleotides and polymers thereof in either single- or double-stranded 
form. Unless specifically limited, the term encompasses nucleic acids 
containing known analogues of natural nucleotides which have similar 

30 properties as the reference natural nucleic acid. Unless otherwise indicated, 
a particular nucleotide sequence also implicitly encompasses conservatively 
modified variants thereof (e.g. degenerate codon substitutions), 
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complementary sequences, subsequences, elongated sequences, as well as 
the sequence explicitly indicated. The terms "nucleic acid molecule" or 
"nucleotide sequence" can also be used in place of "gene", "cDNA", or 
"mRNA". Nucleic acids can be derived from any source, including any 
5 organism. 

The term "isolated", as used in the context of a nucleic acid molecule, 
indicates that the nucleic acid molecule exists apart from its native 
environment and is not a product of nature. An isolated DNA molecule can 
exist in a purified form or can exist in a non-native environment such as a 

1 0 transgenic host cell. 

The term "purified", when applied . to a nucleic acid, denotes that the 
nucleic acid is essentially free of other cellular components with which it is 
associated in the natural state. Preferably, a purified nucleic acid molecule 
is a homogeneous dry or aqueous solution. The term "purified" denotes that 

15 a nucleic acid or protein gives rise to essentially one band in an 
electrophoretic gel. Particularly, it means that the nucleic acid is at least 
about 50% pure, more preferably at least about 85% pure, and most 
preferably at least about 99% pure. 

The term "substantially identical", the context of two nucleotide or 

20 amino acid sequences, can also be defined as two or more sequences or 
subsequences that have at least 60%, preferably 80%, more preferably 90- 
95%, and most preferably at least 99% nucleotide or amino acid sequence 
identity, when compared and aligned for maximum correspondence, as 
measured using one of the following sequence comparison algorithms 

25 (described herein below under the heading Nucleotide and Amino Acid 
Sequence Comparisons ) or by visual inspection. Preferably, the substantial 
identity exists in nucleotide sequences of at least 50 residues, more 
preferably in nucleotide sequence of at least about 100 residues, more 
preferably in nucleotide sequences of at least about 150 residues, and most 

30 preferably in nucleotide sequences comprising complete coding sequences. 
In one aspect; polymorphic sequences can be substantially identical 
sequences. The term "polymorphic" refers to the occurrence of two or more 
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genetically determined alternative sequences or alleles in a population. An 
allelic difference can be as small as one base pair. 

Another indication that two nucleotide sequences are substantially 
identical is that the two molecules specifically or substantially hybridize to 
5 each other under stringent conditions. In the context of nucleic acid 
hybridization, two nucleic acid sequences being compared can be 
designated a "probe" and a "target". A "probe" is a reference nucleic acid 
molecule, and a "'target" is a test nucleic acid molecule, often found within a 
heterogenous population of nucleic acid molecules. A "target sequence" is 

1 0 synonymous with a "test sequence". 

A preferred nucleotide sequence employed for hybridization studies 
or assays includes probe sequences that are complementary to or mimic at 
least an about 14 to 40 nucleotide sequence of a nucleic acid molecule of 
the present invention. Preferably, probes comprise 14 to 20 nucleotides, or 

15 even longer where desired, such as 30, 40, 50, 60, 100, 200, 300, or 500 
nucleotides or up to the full length of any of those set forth as the odd- 
numbered SEQ ID NOs:1-37. Such fragments can be readily prepared by, 
for example, directly synthesizing the fragment by chemical synthesis, by 
application of nucleic acid amplification technology, or by introducing 

20 selected sequences into recombinant vectors for recombinant production. 

The phrase "hybridizing specifically to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide 
sequence under stringent conditions when that sequence is present in a 
complex nucleic acid mixture (e.g., total cellular DNA or RNA). The phrase 

25 "binds substantially to" refers to complementary hybridization between a 
probe nucleic acid molecule and a target nucleic acid molecule and 
embraces minor mismatches that can be accommodated by reducing the 
stringency of the hybridization media to achieve the desired hybridization. 

"Stringent hybridization conditions" and "stringent hybridization wash 

30 conditions" in the context of nucleic acid hybridization experiments such as 
Southern and Northern blot analysis are both sequence- and environment- 
dependent. Longer sequences hybridize specifically at higher temperatures. 
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An extensive guide to the hybridization of nucleic acids is found in Tijssen 
(1993) Laboratory Techniques in Biochemistry and Molecular Biology- 
Hybridization with Nucleic Acid Probes , part I chapter 2, Elsevier, New York, 
New York. Generally, highly stringent hybridization and wash conditions are 
5 selected to be about 5°C lower than the thermal melting point (T m ) for the 
specific sequence at a defined ionic strength and pH. Typically, under 
"stringent conditions" a probe will hybridize specifically to its target 
subsequence, but to no other sequences. 

The T m is the temperature (under defined ionic strength and pH) at 

10 which 50% of the target sequence hybridizes to a perfectly matched probe. 
Very stringent conditions are selected to be equal to the T m for a particular 
probe. An example of stringent hybridization conditions for Southern or 
Northern Blot analysis of complementary nucleic acids having more than 
about 100 complementary residues is overnight hybridization in 50% 

15 formamide with 1 mg of heparin at 42°C. An example of highly stringent 
wash conditions is 15 minutes in 0.1 5 M NaCI at 65°C. An example of 
stringent wash conditions is 15 minutes in 0.2X SSC buffer at 65°C ( See 
Sambrook et al. eds. (1989) Molecular Cloning: A Laboratory Manual . Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York for a 

20 description of SSC buffer). Often, a high stringency wash is preceded by a 
low stringency wash to remove background probe signal. An example of 
medium stringency wash conditions for a duplex of more than about 100 
nucleotides, is 15 minutes in 1X SSC at 45°C. An example of low stringency 
wash for a duplex of more than about 100 nucleotides, is 15 minutes in 4-6X 

25 SSC at 40°C. For short probes (e.g., about 10 to 50 nucleotides), stringent 
conditions typically involve salt concentrations of less than about 1.0 M Na + 
ion, typically about 0.01 to 1.0 M Na + ion concentration (or other salts) at pH 
7.0-8.3, and the temperature is typically at least about 30°C. Stringent 
conditions can also be achieved with the addition of destabilizing agents 

30 such as formamide. In general, a signal to noise ratio of 2-fold (or higher) 
than that observed for an unrelated probe in the particular hybridization 
assay indicates detection of a specific hybridization. 
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The following are examples of hybridization and wash conditions that 
can be used to • clone homologous nucleotide sequences that are 
substantially identical to reference nucleotide sequences of the present 
invention: a probe nucleotide sequence preferably hybridizes to a target 
5 nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 
mM EDTA at 50°C followed by washing in 2X SSC, 0.1% SDS at 50°C; more 
preferably, a probe and target sequence hybridize in 7% sodium dodecyl 
sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C followed by washing in 1X 
SSC, 0.1% SDS at 50°C; more preferably, a probe and target sequence 

10 hybridize in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 
50°C followed by washing in 0.5X SSC, 0.1% SDS at 50°C; more preferably, 
a probe and target sequence hybridize in 7% sodium dodecyl sulfate (SDS), 
0.5 M NaP0 4 , 1 mM EDTA at 50°C followed by washing in 0.1X SSC, 0.1% 
SDS at 50°C; more preferably, a probe and target sequence hybridize in 7% 

15 sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C followed 
by washing in 0.1X SSC, 0.1% SDS at 65°C. 

A further indication that two nucleic acid sequences are substantially 
identical is that proteins encoded by the nucleic acids are substantially 
identical, share an overall three-dimensional structure, are biologically 

20 functional equivalents, or are immunologically cross-reactive. These terms 
are defined further under the heading MS4A Polypeptides herein below. 
Nucleic acid molecules that do not hybridize to each other under stringent 
conditions are still substantially identical if the corresponding proteins are 
substantially identical. This can occur, for example, when two nucleotide 

25 sequences are significantly degenerate as permitted by the genetic code. 

The term "conservatively substituted variants" refers to nucleic acid 
sequences having degenerate codon substitutions wherein the third position 
of one or more selected (or all) codons is substituted with mixed-base and/or 
deoxyinosine residues (Batzer et al. (1991) Nucleic Acids Res 19:5081; 

30 Ohtsuka et al. (1985) J Biol Chem 260:2605-2608; Rossolini et al. (1994) 
Mol Cell Probes 8:91 -98). 
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The term "subsequence" refers to a sequence of nucleic acids that 
comprises a part of a longer nucleic acid sequence. An exemplary 
subsequence is a probe, described herein above, or a primer. The term 
"primer" as used herein refers to a contiguous sequence comprising about 8 
5 or more deoxyribonucleotides or ribonucleotides, preferably 10-20 
nucleotides, and more preferably 20-30 nucleotides of a selected nucleic 
acid molecule. The primers of the invention encompass oligonucleotides of 
sufficient length and appropriate sequence so as to provide initiation of 
polymerization on a nucleic acid molecule of the present invention. 

10 The term "elongated sequence" refers to an addition of nucleotides 

(or other analogous molecules) incorporated into the nucleic acid. For 
example, a polymerase (e.g., a DNA polymerase), e.g., a polymerase which 
adds sequences at the 3' terminus of the nucleic acid molecule. In addition, 
the nucleotide sequence can be combined with other DNA sequences, such 

15 as promoters, promoter regions, enhancers, polyadenylation signals, intronic 
sequences, additional restriction enzyme sites, multiple cloning sites, and 
other coding segments. 

The term "complementary sequence", as used herein, indicates two 
nucleotide sequences that comprise antiparallel nucleotide sequences 

20 capable of pairing with one another upon formation of hydrogen bonds 
between base pairs. As used herein, the term "complementary sequences" 
means nucleotide sequences which are substantially complementary, as can 
be assessed by the same nucleotide comparison set forth above, or is 
defined as being capable of hybridizing to the nucleic acid segment in 

25 question under relatively stringent conditions such as those described 
herein. A particular example of a complementary nucleic acid segment is an 
antisense oligonucleotide. 

The term "gene" refers broadly to any segment of DNA associated 
with a biological function. A gene encompasses sequences including but not 

30 limited to a coding sequence, a promoter region, a cis-regulatory sequence, 
a non-expressed DNA segment is a specific recognition sequence for 
regulatory proteins, a non-expressed DNA segment that contributes to gene 
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expression, a DNA segment designed to have desired parameters, or 
combinations thereof. A gene can be obtained by a variety of methods, 
including cloning from a biological sample, synthesis based on known or 
predicted sequence information, and recombinant derivation of an existing 
5 sequence. 

The term "gene expression" generally refers to the cellular processes 
by which a biologically active polypeptide is produced from a DNA 
sequence. 

The present invention also encompasses chimeric genes comprising 

10 the disclosed MS4A sequences. The term "chimeric gene", as used herein, 
refers to a promoter region operably linked to a MS4A coding sequence, a 
nucleotide sequence producing an antisense RNA molecule, a RNA 
molecule having tertiary structure, such as a hairpin structure, or a double- 
stranded RNA molecule. 

15 The term "operably linked", as used herein, refers to a promoter 

region that is connected to a nucleotide sequence in such a way that the 
transcription of that nucleotide sequence is controlled and regulated by that 
promoter region. Techniques for operatively linking a promoter region to a 
nucleotide sequence are well known in the art. 

20 The terms "heterologous gene", "heterologous DNA sequence", 

"heterologous nucleotide sequence", "exogenous nucleic acid molecule", or 
"exogenous DNA segment", as used herein, each refer to a sequence that 
originates from a source foreign to an intended host cell or, if from the same 
source, is modified from its original form. Thus, a heterologous gene in a 

25 host cell includes a gene that is endogenous to the particular host cell but 
has been modified, for example by mutagenesis or by isolation from native 
cis-regulatory sequences. The terms also include non-naturally occurring 
multiple copies of a naturally occurring nucleotide sequence. Thus, the 
terms refer to a DNA segment that is foreign or heterologous to the cell, or 

30 homologous to the cell but in a position within the host cell nucleic acid 
wherein the element is not ordinarily found. 

The term "promoter region" defines a nucleotide sequence within a 
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gene that is positioned 5' to a coding sequence of a same gene and 
functions to direct transcription of the coding sequence. The promoter 
region includes a transcriptional start site and at least one cis-regulatory 
element. The present invention encompasses nucleic acid sequences that 
5 comprise a promoter region of a MS4A gene, or functional portion thereof. 

The term "cis-acting regulatory sequence" or "cis-regulatory motif or 
"response element", as used herein, each refer to a nucleotide sequence 
that enables responsiveness to a regulatory transcription factor. 
Responsiveness can encompass a decrease or an increase in 

10 transcriptional output and is mediated by binding of the transcription factor to 
the DNA molecule comprising the response element. 

The term "transcription factor" generally refers to a protein that 
modulates gene expression by interaction with the cis-regulatory element 
and cellular components for transcription, including RNA Polymerase, 

15 • Transcription Associated Factors (TAFs), chromatin-remodeling proteins, 
and any other relevant protein that impacts gene transcription. 

A "functional portion" of a promoter gene fragment is a nucleotide 
sequence within a promoter region that is required for normal gene 
transcription. To determine nucleotide sequences that are functional, the 

20 expression of a reporter gene is assayed when variably placed under the 
direction of a promoter region fragment. 

Promoter region fragments can be conveniently made by enzymatic 
digestion of a larger fragment using restriction endonucleases or DNAse I. 
Preferably, a functional promoter region fragment comprises about 5000 

25 nucleotides, more preferably 2000 nucleotides, more preferably about 1000 
nucleotides. Even more preferably a functional promoter region fragment 
comprises about 500 nucleotides, even more preferably a functional 
promoter region fragment comprises about 100 nucleotides, and even more 
preferably a functional promoter region fragment comprises about 20 

30 nucleotides. 

The terms "reporter gene" or "marker gene" or "selectable marker" 
each refer to a heterologous gene encoding a product that is readily 
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observed and/or quantitated. A reporter gene is heterologous in that it 
originates from a source foreign to an intended host cell or, if from the same 
source, is modified from its original form. Non-limiting examples of 
detectable reporter genes that can be operably linked to a transcriptional 
5 regulatory region can be found in Alam & Cook (1990) Anal Biochem 
188:245-254 and PCT International Publication No. WO 97/47763. 
Preferred reporter genes for transcriptional analyses include the lacZ gene 
( See , e.g., Rose & Botstein (1983) Meth Enzymol 101:167-180), Green 
Fluorescent Protein (GFP) (Cubitt et al. (1995) Trends Biochem Sci 20:448- 

10 455), luciferase, or chloramphenicol acetyl transferase (CAT). Preferred 
reporter genes for methods to produce transgenic animals include but are 
not limited to antibiotic resistance genes, and more preferably the antibiotic 
resistance gene confers neomycin resistance. Any suitable reporter and 
detection method can be used, and it will be appreciated by one of skill in 

15 the art that no particular choice is essential to or a limitation of the present 
invention. 

An amount of reporter gene can be assayed by any method for 
qualitatively or preferably, quantitatively determining presence or activity of 
the reporter gene product. The amount of reporter gene expression directed 

20 by each test promoter region fragment is compared to an amount of reporter 
gene expression to a control construct comprising the reporter gene in the 
absence of a promoter region fragment. A promoter region fragment is 
identified as having promoter activity when there is significant increase in an 
amount of reporter gene expression in a test construct as compared to a 

25 control construct. The term "significant increase", as used herein, refers to 
an quantified change in a measurable quality that is larger than the margin of 
error inherent in the measurement technique, preferably an increase by 
about 2-fold or greater relative to a control measurement, more preferably an 
increase by about 5-fold or greater, and most preferably an increase by 

30 about 10-fold or greater. 

The present invention further includes vectors comprising the 
disclosed MS4A sequences, including plasmids, cosmids, and viral vectors. 
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The term "vector", as used herein refers to a DNA molecule having 
sequences that enable its replication in a compatible host cell. A vector also 
includes nucleotide sequences to permit ligation of nucleotide sequences 
within the vector, wherein such nucleotide sequences are also replicated in a 
5 compatible host cell. A vector can also mediate recombinant production of a 
MS4A polypeptide, as described further herein below. Preferred vectors 
include but are not limited to pBluescript (Stratagene), pUC18, pBLCAT3 
(Luckow & Schutz (1987) Nucleic Acids Res 15:5490), pLNTK (Gorman et 
al. (1996) Immunity 5:241-252), and pBAD/glll (Stratagene). A preferred 

10 host cell is a mammalian cell; more preferably the cell is a Chinese hamster 
ovary cell, a HeLa cell, a baby hamster kidney cell, or a mouse cell; even 
more preferably the cell is a human cell. 

Nucleic acids of the present invention can be cloned, synthesized, 
recombinantly altered, mutagenized, or combinations thereof. Standard 

15 recombinant DNA and molecular cloning techniques used to isolate nucleic 
acids are well known in the art. Exemplary, non-limiting methods are 
described by Sambrook et al., eds. (1989); by Silhavy et al. (1984) 
Experiments with Gene Fusions , Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York; by Ausubel et al. (1992) Current Protocols in 

20 Molecular Biology , John Wylie and Sons, Inc., New York, New York; and by 
Glover, ed. (1985) DNA Cloning: A Practical Approach , MRL Press, Ltd., 
Oxford, United Kingdom. Site-specific mutagenesis to create base pair 
changes, deletions, or small insertions are also well known in the art as 
exemplified by publications, see, e.g., Adelman et al., (1983) DAW 2:183; 

25 Sambrook et al. (1 989). 

Sequences detected by methods of the invention can be detected, 
subcloned, sequenced, and further evaluated by any measure well known in 
the art using any method usually applied to the detection of a specific DNA 
sequence including but not limited to dideoxy sequencing, PCR, oligomer 

30 restriction (Saiki et al. (1985) Bio/Technology 3:1008-1012), allele-specific 
oligonucleotide (ASO) probe analysis (Conner et al. (1983) Proc Natl Acad 
Sci USA 80:278), and oligonucleotide ligation assays (OLAs) (Landgren et. 
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al. (1988) Science 241:1007). Molecular techniques for DNA analysis have 
been reviewed (Landgren et. al. (1988) Science 242:229-237). 
I.B. MS4A Polypeptides 

The polypeptides provided by the present invention include the 
5 isolated polypeptides set forth as the even-numbered SEQ ID NOs:2-38, 
polypeptides substantially similar to the even-numbered SEQ ID NOs:2-38, 
MS4A polypeptide fragments, fusion proteins comprising MS4A amino acid 
sequences, biologically functional analogs, and polypeptides that cross-react 
with an antibody that specifically recognizes a MS4A polypeptide. 
10 The term "isolated", as used in the context of a polypeptide, indicates 

that the polypeptide exists apart from its native environment and is not a 
product of nature. An isolated polypeptide can exist in a purified form or can 
exist in a non-native environment such as, for example, in a transgenic host 
cell. 

15 The term "purified", when applied to a polypeptide, denotes that the 

polypeptide is essentially free of other cellular components with which it is 
associated in the natural state. Preferably, a polypeptide is a homogeneous 
solid or aqueous solution. Purity and homogeneity are typically determined 
using analytical chemistry techniques such as polyacrylamide gel 

20 electrophoresis or high performance liquid chromatography. A polypeptide 
which is the predominant species present in a preparation is substantially 
purified. The term "purified" denotes that a polypeptide gives rise to 
essentially one band in an electrophoretic gel. Particularly, it means that the 
polypeptide is at least about 50% pure, more preferably at least about 85% 

25 pure, and most preferably at least about 99% pure. 

The term "substantially identical" in the context of two or more 
polypeptides sequences is measured by (a) polypeptide sequences having 
about 35%, or 45%, or preferably from 45-55%, or more preferably 55-65%, 
or most preferably 65% or greater amino acids which are identical or 

30 functionally equivalent. Percent "identity" and methods for determining 
identity are defined herein below under the heading Nucleotide and Amino 
Acid Sequence Comparisons . 
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Substantially identical polypeptides also encompass two or more 
polypeptides sharing a conserved three-dimensional structure. 
Computational methods can be used to compare structural representations, 
and structural models can be generated and easily tuned to identify 
5 similarities around important active sites or ligand binding sites. See 
Henikoff et al. (2000) Electrophoresis 21 (9):1 700-1 706; Huang et al. (2000) 
Pac Symp Biocomput 230-241; Saqi et al. (1999) Bioinformatics 15(6):521- 
522; and Barton (1 998) Acta Crystallogr D Biol Crystallogr 54:1 139-1 146. 

The term "functionally equivalent" in the context of amino acid 

10 sequences is well known in the art and is based on the relative similarity of 
the amino acid side-chain substituents. See Henikoff & Henikoff (2000) Adv 
Protein Chem 54:73-97. Relevant factors for consideration include side- 
chain hydrophobicity, hydrophilicity, charge, and size. For example, 
arginine, lysine, and histidine are all positively charged residues; that 

15 alanine, glycine, and serine are all of similar size; and that phenylalanine, 
tryptophan, and tyrosine all have a generally similar shape. By this analysis, 
described further herein below, arginine, lysine, and histidine; alanine, 
glycine, and serine; and phenylalanine, tryptophan, and tyrosine; are defined 
herein as biologically functional equivalents. 

20 In making biologically functional equivalent amino acid substitutions, 

the hydropathic index of amino acids can be considered. Each amino acid 
has been assigned a hydropathic index on the basis of their hydrophobicity 
and charge characteristics, these are: isoleucine (+ 4.5); valine (+ 4.2); 
leucine (+ 3.8); phenylalanine (+ 2.8); cysteine (+ 2.5); methionine (+ 1.9); 

25 alanine (+ 1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (- 
0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and 
arginine (-4.5). 

The importance of the hydropathic amino acid index in conferring 
30 interactive biological function on a protein is generally understood in the art 
(Kyte et al. (1982) J Mol Biol 157:105.). It is known that certain amino acids 
can be substituted for other amino acids having a similar hydropathic index 
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or score and still retain a similar biological activity. In making changes based 
upon the hydropathic index, the substitution of amino acids whose 
hydropathic indices are within ±2 of the original value is preferred, those 
which are within ±1 of the original value are particularly preferred, and those 
5 within ±0.5 of the original value are even more particularly preferred. 

It is also understood in the art that the substitution of like amino acids 
can be made effectively on the basis of hydrophilicity. U.S. Patent No. 
4,554,101 states that the greatest local average hydrophilicity of a protein, 
as governed by the hydrophilicity of its adjacent amino acids, correlates with 
10 its immunogenicity and antigenicity, i.e. with a biological property of the 
protein. It is understood that an amino acid can be substituted for another 
having a similar hydrophilicity value and still obtain a biologically equivalent 
protein. 

As detailed in U.S. Patent No. 4,554,101, the following hydrophilicity 
15 values have been assigned to amino acid residues: arginine (+ 3.0); lysine 
(+ 3.0); aspartate (+ 3.0±1); glutamate (+ 3.0±1); serine (+ 0.3); asparagine 
(+ 0.2); glutamine (+ 0.2); glycine (0); threonine (-0.4); proline (-0.5±1); 
alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); 
leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); 
20 tryptophan (-3.4) . 

In making changes based upon similar hydrophilicity values, the 
substitution of amino acids whose hydrophilicity values are within ±2 of the 
original value is preferred, those which are within ±1 of the original value are 
particularly preferred, and those within ±0.5 of the original value are even 
25 more particularly preferred. 

The present invention also encompasses MS4A polypeptide 
fragments or functional portions of a MS4A polypeptide. Such functional 
portion need not comprise all or substantially all of the amino acid sequence 
of a native MS4A gene product. The term "functional" includes any 
30 biological activity or feature of MS4A, including immunogenicity. 

The present invention also includes longer sequences of a MS4A 
polypeptide, or portion thereof. For example, one or more amino acids can 
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be added to the N-terminus or C-terminus of a MS4A polypeptide. Fusion 
proteins comprising MS4A polypeptide sequences are also provided within 
the scope of the present invention. Methods of preparing such proteins are 
known in the art. 

5 The present invention also encompasses functional analogs of a 

MS4A polypeptide. Functional analogs share at least one biological function 
with a MS4A polypeptide. An exemplary function is immunogenicity. In the 
context of amino acid sequence, biologically functional analogs, as used 
herein, are peptides in which certain, but not most or all, of the amino acids 

10 can be substituted. Functional analogs can be created at the level of the 
corresponding nucleic acid molecule, altering such sequence to encode 
desired amino acid changes. In one embodiment, changes can be 
introduced to improve the antigenicity of the protein. In another 
embodiment, a MS4A polypeptide sequence is varied so as to assess the 

1 5 activity of a mutant MS4A polypeptide. 

The present invention also encompasses recombinant production of 
the disclosed MS4A polypeptides. Briefly, a nucleic acid sequence encoding 
a MS4A polypeptide, or portion thereof, is cloned into a expression cassette, 
the cassette is introduced into a host organism, where it is recombinantly 

20 produced. 

The term "expression cassette" as used herein 'means a DNA 
sequence capable of directing expression of a particular nucleotide 
sequence in an appropriate host cell, comprising a promoter operably linked 
to the nucleotide sequence of interest which is operably linked to termination 

25 signals. It also typically comprises sequences required for proper translation 
of the nucleotide sequence. The expression cassette comprising the 
nucleotide sequence of interest can be chimeric. The expression cassette 
can also be one which is naturally occurring but has been obtained in a 
recombinant form useful for heterologous expression. 

30 The expression of the nucleotide sequence in the expression cassette 

can be under the control of a constitutive promoter or an inducible promoter 
which initiates transcription only when the host cell is exposed to some 
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particular external stimulus. Exemplary promoters include Simian virus 40 
early promoter, a long terminal repeat promoter from retrovirus, an action 
promoter, a heat shock promoter, and a metallothien protein. In the case of 
a multicellular organism, the promoter and promoter region can direct 
5 expression to a particular tissue or organ or stage of development. 
Exemplary tissue-specific promoter regions include a MS4A promoter, 
described herein. Suitable expression vectors which can be used include, 
but are not limited to, the following vectors or their derivatives: human or 
animal viruses such as vaccinia virus or adenovirus, yeast vectors, 
10 bacteriophage vectors (e.g., lambda phage), and plasmid and cosmids DNA 
vectors. 

The term "host cell", as used herein, refers to a cell into which a 
heterologous nucleic acid molecule has been introduced. Transformed cells, 
tissues, or organisms are understood to encompass not only the end product 
15 of a transformation process, but also transgenic progeny thereof. 

A host cell strain can be chosen which modulates the expression of 
the inserted sequences, or modifies and processes the gene product in the 
specific fashion desired. For example, different host cells have 
characteristic and specific mechanisms for the translational and post- 
20 transactional processing and modification (e.g., glycosylation, 
phosphorylation of proteins). Appropriate cell lines or host systems can be 
chosen to ensure the desired modification and processing of the foreign 
protein expressed. Expression in a bacterial system can be used to produce 
a non-glycosylated core protein product. Expression in yeast will produce a 
25 glycosylated product. Expression in animal cells can be used to ensure 
"native" glycosylation of a heterologous protein. 

Expression constructs are transfected into a host cell by any standard 
method, including electroporation, calcium phosphate precipitation, DEAE- 
Dextran transfection, liposome-mediated transfection, and infection using a 
30 retrovirus. The MS4A-encoding nucleotide sequence carried in the 
expression construct can be stably integrated into the genome of the host or 
it can be present as an extrachromosomal molecule. 
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Isolated polypeptides and recombinantly produced polypeptides can 
be purified and characterized using a variety of standard techniques that are 
well known to the skilled artisan. See, e.g. Ausubel et al. (1992), 
Bodanszky, et al. (1976) Peptide Synthesis , John Wiley and Sons, Second 
5 Edition, New York, New York and Zimmer et al. (1993) Peptides, pp. 393- 
394, ESCOM Science Publishers, B. V. 

I.C. Nucleotide and Amino Acid Sequence Comparisons 
The terms "identical" or percent "identity" in the context of two or more 
nucleotide or polypeptide sequences, refer to two or more sequences or 

10 subsequences that are the same or have a specified percentage of amino 
acid residues or nucleotides that are the same, when compared and aligned 
for maximum correspondence, as measured using one of the sequence 
comparison algorithms disclosed herein or by visual inspection.. 

The term "substantially identical" in regards to a nucleotide or 

15 polypeptide sequence means that a particular sequence varies from the 
sequence of a naturally occurring sequence by one or more deletions, 
substitutions, or additions, the net effect of which is to retain at least some of 
biological activity of the natural gene, gene product, or sequence. Such 
sequences include "mutant" sequences, or sequences wherein the biological 

20 activity is altered to some degree but retains at least some of the original 
biological activity. The term "naturally occurring", as used herein, is used to 
describe a composition that can be found in nature as distinct from being 
artificially produced by man. For example, a protein or nucleotide sequence 
present in an organism, which can be isolated from a source in nature and 

25 which has not been intentionally modified by man in the laboratory, is 
naturally occurring. 

For sequence comparison, typically one sequence acts as a 
reference sequence to which test sequences are compared. When using a 
sequence comparison algorithm, test and reference sequences are entered 

30 into a computer program, subsequence coordinates are designated if 
necessary, and sequence algorithm program parameters are selected. The 
sequence comparison algorithm then calculates the percent sequence 
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identity for the designated test sequence(s) relative to the reference 
sequence, based on the selected program parameters. 

Optimal alignment of sequences for comparison can be conducted, 
e.g., by the local homology algorithm of Smith & Waterman (1981) Adv Appl 
5 Math 2:482, by the homology alignment algorithm of Needleman & Wunsch 
(1970) J Mol Biol 48:443, by the search for similarity method of Pearson & 
Lipman (1988) Proc Natl Acad Sci USA 85:2444-2448, by computerized 
implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA 
in the Wisconsin Genetics Software Package, Genetics Computer Group, 

10 Madison, Wl), or by visual inspection. See generally, Ausubel et al., 1992. 

A preferred algorithm for determining percent sequence identity and 
sequence similarity is the BLAST algorithm, which is described in Altschul et 
al. (1990) J Mol Biol 215: 403-410. Software for performing BLAST analyses 
is publicly available through the National Center for Biotechnology 

15 Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first 
identifying high scoring sequence pairs (HSPs) by identifying short words of 
length W in the query sequence, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score 

20 threshold. These initial neighborhood word hits act as seeds for initiating 
searches to find longer HSPs containing them. The word hits are then 
extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are 
calculated using, for nucleotide sequences, the parameters M (reward score 

25 for a pair of matching residues; always > 0) and N (penalty score for 
mismatching residues; always < 0). For amino acid sequences, a scoring 
matrix is used to calculate the cumulative score. Extension of the word hits 
in each direction are halted when the cumulative alignment score falls off by 
the quantity X from its maximum achieved value, the cumulative score goes 

30 to zero or below due to the accumulation of one or more negative-scoring 
residue alignments, or the end of either sequence is reached. The BLAST 
algorithm parameters W, T, and X determine the sensitivity and speed of the 
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alignment. The BLASTN program (for nucleotide sequences) uses as 
defaults a wordlength W=11, an expectation E=10, a cutoff of 100, M=5, N=- 
4, and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) 
5 of 10, and the BLOSUM62 scoring matrix. See Henikoff & Henikoff (1989) 
Proc Natl Acad Sci USA 89:10915. 

In addition to calculating percent sequence identity, the BLAST 
algorithm also performs a statistical analysis of the similarity between two 
sequences. See, e.g., Karlin and Altschul (1993) Proc Natl Acad Sci USA 

10 90:5873-5887. One measure of similarity provided by the BLAST algorithm 
is the smallest sum probability (P(N)), which provides an indication of the 
probability by which a match between two nucleotide or amino acid 
sequences would occur by chance. For example, a test nucleic acid 
sequence is considered similar to a reference sequence if the smallest sum 

1 5 probability in a comparison of the test nucleic acid sequence to the reference 
nucleic acid sequence is less than about 0.1, more preferably less than 
about 0.01 , and most preferably less than about 0.001 . 
I.D. Antibodies 

The present invention also provides an antibody that specifically binds 
20 a MS4A polypeptide. The term "antibody" indicates an immunoglobulin 
protein, or functional portion thereof, including a polyclonal antibody, a 
monoclonal antibody, a chimeric antibody, a single chain antibody, Fab 
fragments, and an Fab expression library. "Functional portion" refers to the 
part of the protein that binds a molecule of interest. In a preferred 
25 embodiment, an antibody of the invention is a monoclonal antibody. 
Techniques for preparing and characterizing antibodies are well known in 
the art ( See , e.g., Harlow & Lane (1988) Antibodies: A Laboratory Manual . 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York). A 
monoclonal antibody of the present invention can be readily prepared 
30 through use of well-known techniques such as the hybridoma techniques 
exemplified in U.S. Patent No 4,196,265 and the phage-displayed 
techniques disclosed in U.S. Patent No. 5,260,203. 
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The phrase "specifically (or selectively) binds to an antibody", or 
"specifically (or selectively) immunoreactive with", when referring to a protein 
or peptide, refers to a binding reaction which is determinative of the 
presence of the protein in a heterogeneous population of proteins and other 
5 biological materials. Thus, under designated immunoassay conditions, the 
specified antibodies bind to a particular protein and do not show significant 
binding to other proteins present in the sample. Specific binding to an 
antibody under such conditions can require an antibody that is selected for 
its specificity for a particular protein. For example, antibodies raised to a 

10 protein with an amino acid sequence encoded by any of the nucleic acid 
sequences of the invention can be selected to obtain antibodies specifically 
immunoreactive with that protein and not with unrelated proteins. 

The use of a molecular cloning approach to generate antibodies, 
particularly monoclonal antibodies, and more particularly single chain 

15 monoclonal antibodies, are also provided. The production of single chain 
antibodies has been described in the art. See , e.g., U.S. Patent No. 
5,260,203. For this approach, combinatorial immunoglobulin phagemid 
libraries are prepared from RNA isolated from the spleen of the immunized 
animal, and phagemids expressing appropriate antibodies are selected by 

20 panning on endothelial tissue. The advantages of this approach over 
conventional hybridoma techniques are that approximately 10 4 times as 
many antibodies can be produced and screened in a single round, and that 
new specificities are generated by heavy (H) and light (L) chain 
combinations in a single chain, which further increases the chance of finding 

25 appropriate antibodies. Thus, an antibody of the present invention, or a 
"derivative" of an antibody of the present invention, pertains to a single 
polypeptide chain binding molecule which has binding specificity and affinity 
substantially similar to the binding specificity and affinity of the light and 
heavy chain aggregate variable region of an antibody described herein. 

30 The term "immunochemical reaction", as used herein, refers to any of 

a variety of immunoassay formats used to detect antibodies specifically 
bound to a particular protein, including but not limited to competitive and 
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non-competitive assay systems using techniques such as 
radioimmunoassays, ELISA (enzyme linked immunosorbent assay), 
"sandwich" immunoassays, immunoradiometric assays, gel diffusion 
precipitation reactions, immunodiffusion assays, in situ immunoassays (e.g., 
5 using colloidal gold, enzyme or radioisotope labels), western blots, 
precipitation reactions, agglutination assays (e.g., gel agglutination assays, 
hemagglutination assays), complement fixation assays, immunofluorescence 
assays, protein A assays, and Immunoelectrophoresis assays, etc. See 
Harlow & Lane (1988) for a description of immunoassay formats and 
10 conditions. 

I.E. Protein Binding Assays 

The term "binding" refers to an affinity between two molecules, for 
example, a ligand and a receptor. As used herein, "binding" means a 
preferential binding of one molecule for another in a mixture of molecules. 

15 The binding of the molecules can be considered specific if the binding affinity 
is about 1 x 10 4 M" 1 to about 1 x 10 6 M' 1 or greater. Binding of two 
molecules also encompasses a quality or state of mutual action such that an 
activity of one protein or compound on another protein is inhibitory (in the 
case of an antagonist) or enhancing (in the case of an agonist). Exemplary 

20 protein binding assays include but are not limited to Fluorescence 
Correlation Spectroscopy (FCS), Surface-Enhanced Laser 
Desorption/lonization time-of-flight mass spectrometry (SELDI-TOF), and 
Biacore, each described further herein below. 

Fluorescence Correlation Spectroscopy (FCS) measures the average 

25 diffusion rate of a fluorescent molecule within a small sample volume 
(Madge et al. (1972) Phys Rev Lett 29:705-708; Maiti et al. (1997) Proc Natl 
Acad Sci USA, 94:11753-11757). The sample size can be as low as 10 3 
fluorescent molecules and the sample volume as low as the cytoplasm of a 
single bacterium. The diffusion rate is a function of the mass of the molecule 

30 and decreases as the mass increases. FCS can therefore be applied to 
protein-ligand interaction analysis by measuring the change in mass and 
therefore in diffusion rate of a molecule upon binding. In a typical 
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experiment, the target to be analyzed is expressed as a recombinant protein 
with a sequence tag, such as a poly-histidine sequence, inserted at the N- 
terminus or C-terminus. The expression takes place in E. coli, yeast or 
mammalian cells. The protein is purified using chromatographic methods. 
5 For example, the poly-histidine tag can be used to bind the expressed 
protein to a metal chelate column such as Ni 2+ chelated on iminodiacetic 
acid agarose. The protein is then labeled with a fluorescent tag such as 
carboxytetramethylrhodamine or BODIPY™ (Molecular Probes, Eugene, 
Oregon). The protein is then exposed in solution to the potential ligand, and 

10 its diffusion rate is determined by FCS using instrumentation available from 
Carl Zeiss, Inc. (Thornwood, New York). Ligand binding is determined by 
changes in the diffusion rate of the protein. 

. Surface-Enhanced Laser Desorption/lonization (SELDI) was 
developed by Hutchens & Yip (1993) Rapid Commun Mass Spectrom 7:576- 

15 580). When coupled to a time-of-flight mass spectrometer (TOF), SELDI 
provides a means to rapidly analyze molecules retained on a chip. It can be 
applied to ligand-protein interaction analysis by covalently binding the target 
protein on the chip and analyzing by MS the small molecules that bind to this 
protein (Worrall et al. (1998) Anal Biochem 70:750-756). In a typical 

20 experiment, the target to be analyzed is expressed as described for FCS. 
The purified protein is then used in the assay without further preparation. It 
is bound to the SELDI chip either by utilizing the poly-histidine tag or by 
other interaction such as ion exchange or hydrophobic interaction. The chip 
thus prepared is then exposed to the potential ligand via, for example, a 

25 delivery system able to pipet the ligands in a sequential manner 
(autosampler). The chip is then washed in solutions of increasing 
stringency, for example a series of washes with buffer solutions containing 
an increasing ionic strength. After each wash, the bound material is 
analyzed by submitting the chip to SELDI-TOF. Ligands that specifically 

30 bind the target are identified by the stringency of the wash needed to elute 
them. 
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Biacore relies on changes in the refractive index at the surface layer 
upon binding of a ligand to a protein immobilized on the layer. In this 
system, a collection of small ligands is injected sequentially in a 2-5 
microliter cell, wherein the protein is immobilized within the cell. Binding is 
5 detected by surface plasmon resonance (SPR) by recording laser light 
refracting from the surface. In general, the refractive index change for a 
given change of mass concentration at the surface layer is practically the 
same for all proteins and peptides, allowing a single method to be applicable 
for any protein (Liedberg et al. (1983) Sensors Actuators 4:299-304; 

10 Malmquist (1993) Nature 361:186-187). In atypical experiment, the target to 
be analyzed is expressed as described for FCS. The purified protein is then 
used in the assay without further preparation. It is bound to the Biacore chip 
either by utilizing the poly-histidine tag or by other interaction such as ion 
exchange or hydrophobic interaction. The chip thus prepared is then 

15 exposed to the potential ligand via the delivery system incorporated in the 
instruments sold by Biacore (Uppsala, Sweden) to pipet the ligands in a 
sequential manner (autosampler). The SPR signal on the chip is recorded 
and changes in the refractive index indicate an interaction between the 
immobilized target and the ligand. Analysis of the signal kinetics of on rate 

20 and off rate allows the discrimination between non-specific and specific 
interaction. 

I.F. Transgenic animals 

It is also within the scope of the present invention to prepare a 
transgenic animal to mutagenize the MS4A locus or to express a transgene 

25 comprising nucleic acid sequences of the present invention. The term 
"transgenic animal", indicates an animal comprising a germline insertion of a 
heterologous nucleic acid. Transgenic animals of the present invention are 
understood to encompass not only the end product of a transformation 
method, but also transgenic progeny thereof. 

30 The term "transgene", as used herein indicates a heterologous 

nucleic acid molecule that has been transformed into a host cell. For 
intended use in the creation of a transgenic animal, the transgene includes 
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genomic sequences of the host organism at a selected locus or site of 
transgene integration to mediate a homologous recombination event. A 
transgene further comprises nucleic acid sequences of interest, for example 
a targeted modification of the gene residing within the locus, a reporter gene, 
5 or a expression cassette, each defined herein above. 

Transgene integration can be used to create gene mutations, 
including "knock-out", "knock-in", or a "knock-down" mutations. 
Representative approaches are disclosed in the Examples presented below. 
The term "knock-out" refers to a homologous recombination event that 

10 renders a gene inactive. Gene knock-out is generally accomplished by 
integration of the transgene at a chromosomal loci, thereby interrupting a 
gene residing at that loci. The term "knock-in" refers to in vivo replacement 
at a targeted locus. Knock-in mutations can modify a gene sequence to 
create a loss-of-f unction or gain-of-function mutation. The term "gene 

15 knock-down" refers to a homologous recombination event wherein the 
transgene partially eliminates gene function. A knock-down animal can be 
created by transgenic expression of an antisense molecule, wherein a 
transgene comprising the antisense sequence and a relevant promoter are 
integrated into the genome at a non-essential loci. Expression of the 

20 antisense or ribozyme molecule disrupts the corresponding gene function, 
although this disruption is generally incomplete (Luyckx et al. (1999) Proc 
Natl Acad Sci U S A 96(2 1 ): 1 2 1 74-1 2 1 79). 

Conditional mutation can be accomplished using transgenic methods 
in combination with the Cre-recombinase system in mice. Briefly, in one 

25 instance, a transgenic mouse is derived that expresses Cre-recombinase 
under the direction of an inducible promoter. A second transgenic mouse 
bears a mutation of a gene of interest as well as a lox-P-flanked endogenous 
gene sequence. Such transgenic mice are mated, the resulting progeny 
having both the Cre-recombinase and lox-P-flanked transgenes. Induction 

30 of Cre recombinase catalyzes excision of the lox-P-flanked transgene, 
thereby excising a portion of the endogenous gene sequence and revealing 
the mutated sequence. Conditional knockout can be varied according to the 
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temporal and spatial features of Cre recombinase expression, inherent in the 
selection of a promoter to drive Cre recombinase. See Postic et al. (1999) J 
Biol CA7e/77 275(1):305-315; and Sauer (1998) Methods 14(4):381-392. 

Transgenes can also be used for heterologous expression in a host 
5 organism without generating phenotypically apparent mutations. By this 
method, nucleotide sequences of interest are introduced into the genome at 
a nonessential loci, whereby insertion alone does not disrupt an essential 
gene function. Optionally, expression of the transgene can generate a gain- 
of-function or ectopic function phenotype. 

10 Techniques for the preparation of transgenic animals are known in the 

art. Exemplary techniques are described in U.S. Patent No. 5,489,742 
(transgenic rats); U.S. Patent Nos. 4,736,866, 5,550,316, 5,614,396, 
5,625,125 and 5,648,061 (transgenic mice); U.S. Patent No. 5,573,933 
(transgenic pigs); 5,162,215 (transgenic avian species) and U.S. Patent No. 

15 5,741,957 (transgenic bovine species). Briefly, nucleotide sequences of 
interest are cloned into a vector, and the construct is transformed into a 
germ cell. In the germ cell, a chromosomal rearrangement event takes 
place wherein the nucleic acid sequences of interest are integrated into the 
genome of the germ cell by homologous recombination. Fertilization and 

20 propagation of the transformed germ cell results in a transgenic animal. 
Homozygosity of the mutation is accomplished by intercrossing. 
I.G. Therapeutic Methods 

The present invention further provides methods for discovering 
substances that can be used as pharmaceutical compositions. The term 
25 "pharmaceutical composition" or "drug" as used herein, each refer to any 
substance having a biological activity. Substances discovered by methods 
of the present invention include but are not limited to polypeptide, proteins, 
peptides, chemical compounds, and antibodies. 

A composition of the present invention is typically formulated using 
30 acceptable vehicles, adjuvants, and carriers as desired. 

Among the acceptable vehicles and solvents that can be employed 
are water, Ringer's solution, and isotonic sodium chloride solution. In 
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addition, sterile, fixed oils are conventionally employed as a solvent or 
suspending medium. For this purpose any bland fixed oil can be employed 
including synthetic mono- or di-glycerides. In addition, fatty acids such as 
oleic acid find use in the preparation of injectable compositions. 
5 Injectable preparations, for example sterile injectable aqueous or 

oleaginous suspensions, are formulated according to the known art using 
suitable dispersing or wetting agents and suspending agents. The sterile 
injectable preparation can also be a sterile injectable solution or suspension 
in a nontoxic diluent or solvent, for example 1 ,3-butanediol. 

10 A vector can be used as a carrier, for example an adenovirus vector, 

can be used for gene therapy methods. The vector is purified to sufficiently 
render it essentially free of undesirable contaminants, such as defective 
interfering adenovirus particles or endotoxins and other pyrogens such that it 
does not cause any untoward reactions in the individual receiving the vector 

15 construct. A preferred means of purifying the vector involves the use of 
buoyant density gradients, such as cesium chloride gradient centrifugation. 

A transfected cell can also serve as a carrier. By way of example, a 
liver cell can be removed from an organism, transfected with a nucleic acid 
sequence of the present invention using methods set forth above and then 

20 the transfected cell returned to the organism (e.g. injected intra-vascularly). 

Monoclonal antibodies or polypeptides of the invention can be 
administered parenterally by injection or by gradual infusion over time. 
Although the tissue to be treated can typically be accessed in the body by 
systemic administration and therefore most often treated by intravenous 

25 administration of therapeutic compositions, other tissues and delivery means 
are provided where there is a likelihood that the tissue targeted contains the 
target molecule and are known to those of skill in the art. 

Representative antibodies for use in the present invention are intact 
immunoglobulin molecules, substantially intact immunoglobulin molecules, 

30 single chain immunoglobulins or antibodies, those portions of an 
immunoglobulin molecule that contain the paratope, including antibody 
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fragments. It is within the scope of the present invention that a monovalent 
modulator can optionally be used. 

Methods of preparing "humanized" antibodies are generally well 
known in the art, and can readily be applied to the antibodies of the present 
5 invention. Humanized monoclonal antibodies offer particular advantages 
over monoclonal antibodies derived from other mammals, particularly insofar 
as they can be used therapeutically in humans. Specifically, humanized 
antibodies are not cleared from the circulation as rapidly as "foreign" 
antigens, and do not activate the immune system in the same manner as 

10 foreign antigens and foreign antibodies. 

With respect to the therapeutic methods of the present invention, a 
preferred subject is a vertebrate subject. A preferred vertebrate is warm- 
blooded; a preferred warm-blooded vertebrate is a mammal. A preferred 
mammal is a mouse or, most preferably, a human. As used herein and in 

15 the claims, the term "patient" includes both human and animal patients. 
Thus, veterinary therapeutic uses are provided in accordance with the 
present invention. 

Also provided is the treatment of mammals such as humans, as well 
as those mammals of importance due to being endangered, such as 

20 Siberian tigers; of economical importance, such as animals raised on farms 
for consumption by humans; and/or animals of social importance to humans, 
such as animals kept as pets or in zoos. Examples of such animals include 
but are not limited to: carnivores such as cats and dogs; swine, including 
pigs, hogs, and wild boars; ruminants and/or ungulates such as cattle, oxen, 

25 sheep, giraffes, deer, goats, bison, and camels; and horses. Also provided 
is the treatment of birds, including the treatment of those kinds of birds that 
are endangered and/or kept in zoos, as well as fowl, and more particularly 
domesticated fowl, i.e., poultry, such as turkeys, chickens, ducks, geese, 
guinea fowl, and the like, as they are also of economical importance to 

30 humans. Thus, provided is the treatment of livestock, including, but not 
limited to, domesticated swine, ruminants, ungulates, horses, poultry, and 
the like. 
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As used herein, the term "experimental subject" refers to any subject 
or sample in which the desired measurement is unknown. The term "control 
subject" refers to any subject or sample in which a desired measure is 
unknown. 

5 As used herein, an "effective" dose refers to a dose(s) administered to 

an individual patient sufficient to cause a change in MS4A activity. One of 
ordinary skill in the art can tailor the dosages to an individual patient, taking 
into account the particular formulation and method of administration to be 
used with the composition as well as patient height, weight, severity of 

10 symptoms, and stage of the biological condition to be treated. Such 
adjustments or variations, as well as evaluation of when and how to make 
such adjustments or variations, are well known to those of ordinary skill in 
the art of medicine. 

A therapeutically effective amount can comprise a range of amounts. 

15 One skilled in the art can readily assess the potency and efficacy of a MS4A 
modulator of this invention and adjust the therapeutic regimen accordingly. 
A modulator of MS4A biological activity can be evaluated by a variety of 
means including the use of a responsive reporter gene, interaction of MS4A 
polypeptides with a monoclonal antibody, analysis of cell subpopulations, 

20 and measurement of [Ca 2+ ]j levels, each technique described herein. 

Additional formulation and dose techniques have been described in 
the art, see for example, those described in U.S. Patent Nos. 5,326,902 and 
5,234,933, and International Publication No. WO 93/25521. 

For the purposes described above, the identified substances can 

25 normally be administered systemically, parenterally, or orally. The term 
"parenteral" as used herein includes intravenous, intra-muscular, intra- 
arterial injection, or infusion techniques. Other compositions for 
administration include liquids for external use, and endermic liniments 
(ointment, etc.), suppositories, and pessaries which comprise one or more of 

30 the active substance(s) and can be prepared by known methods. 
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|L CD20 Gene Family Members 

II.A. Identification of CD20 Gene Family Members 
The present invention provides MS4A nucleic acid and polypeptide 
sequences. Preferably, a MS4A gene comprises the sequence set forth as 
5 any one of the odd-numbered SEQ ID NOs:1-37, a nucleic acid molecule 
that is substantially similar to any one of the odd-numbered SEQ ID NOs:1- 
37, or a nucleic acid molecule comprising a 20 base pair nucleotide 
sequence that is identical to a contiguous 20 base pair sequence of any one 
of the odd-numbered SEQ ID NOs:1-37. 

10 To identify new CD20 gene family members, the human and mouse 

CD20 amino acid sequences (Tedder et al., 1988a; Tedder et al., 1988b) 
were used to search the translated GenBank databases, including 
expressed sequence tags, using the BLAST program (Altschul et al., 1997). 
Among 337 homologous sequences identified, at least 17 novel genes 

15 expressed by mouse, human, and pig had predicted amino acid sequences 
homologous to CD20. Complete coding regions were predicted using 
overlapping nucleotide sequences obtained from sequenced ESTs and 
cDNAs that corresponded to unique, near full-length transcripts in humans 
and mice (Figure 1). All nucleotide sequences were verified by sequencing 

20 multiple near full-length cDNAs isolated by applicants and 40 cDNAs 
obtained from the ATCC (American Tissue Culture Collection, Bethesda, 
Maryland, USA). In addition, a pig cDNA and its human counterpart 
homologous to CD20 were identified as GenBank submissions AJ236932.1 
and AK000224, respectively. In total, unique cDNA clones were identified 

25 that encode at least 16 distinct full-length CD20-like proteins. 

Complete cDNA sequences encoding the human and mouse MS4A 
family members (MS4A1, -A2, -A3, -A4A, -A5, -A6A, -A7, -A8B and -A12) 
were also used to search the GenBank human genomic database (htgs; 
http://www.ncbi.nlm.nih.gov/blast/) using the BLAST program (Altschul et al., 

30 1997), as further described in Example 2. Two-hundred-twenty different 
contigs or distinct genomic DNA sequences were identified in the database 
of unfinished human genomic sequences that were either identical or similar 
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to MS4A family members. These sequences were predominantly derived 
from sixteen partially sequenced bacterial artificial chromosomes (BACs) 
that spanned 400-500 kb of human chromosome 1 1q12 (Table 1). Based on 
known cDNA sequences of MS4A family members, we were able to order 
5 and arrange these genomic sequences into overlapping continuous DNA 
segments. Since many of the contigs identified were overlapping, it was 
thereby possible to assemble long DNA sequences that encoded entire 
MS4A genes or portions of MS4A genes. Gaps between exon encoding 
DNA sequences were filled in many cases by additional sequence homology 

10 searches using DNA sequences found at the ends of gaps. When sequence 
differences were observed between different overlapping DNA fragments, 
consensus sequences were used or PGR primers were generated, that 
portion of genomic DNA was then amplified and sequenced to resolve 
ambiguous sequences. 

15 BLAST analysis of the htgs phase 1 or phase 2 human genomic DNA 

sequences encoding MS4A cDNAs and the assembled and annotated 
human genomic sequence thereof, as disclosed herein, revealed the 
presence of each known human MS4A family member. In addition, three 
putative genes encoding unique MS4A family members were identified that 

20 localized to the q1 2-1 3.1 region of human chromosome 11. Complete 
coding regions were predicted using overlapping nucleotide sequences 
obtained from sequenced ESTs and cDNAs and by comparison of gene 
structure, described further herein below (Figure 2). 

By identifying sequences that correlated with different MS4A genes in 

25 each BAG (Table 1), and by the assembly of minimal genomic DNA lengths 
that could encode each MS4A gene (Figure 2), we used the overlapping 
BACs to identify the order of the MS4A genes on chromosome 11q12 
(Figure 6). This analysis also allowed us to determine the direction of gene 
transcription for most MS4A genes. Furthermore, the MS4A cDNA 

30 sequences, disclosed herein, were used to assemble genomic clones set 
forth as SEQ ID NOs:73-81. In some cases, multiple MS4A genes could be 
aligned within a continuous genomic sequence. For example, the genomic 
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sequence set forth as SEQ ID NO:77 comprises both the MS4A4E and 
MS4A6A genes. Similarly, the genomic region set forth as SEQ ID NO:79 
comprises three MS4A genes: MS4A 7, MS4A5, and MS4A12. 

The MS4A4E gene encodes 660 bp of translated sequence (Figure 
5 3), contained within at least seven exons (Figure 2). Exons were identified 
based on their sequence similarities with MS4A4A sequences and the 
identification of canonical splice-donor and -acceptor sites (Aebi & 
Weissmann, 1987). The MS4A4E gene sequence was at least 23,379 base 
pairs in length, if counted from the putative translation initiation ATG site until 

10 the TGA translation termination stop site (Figure 2). An exon encoding the 
putative 5' untranslated region of MS4A4E, was highly homologous with the 
corresponding sequence in MS4A4A cDNAs (disclosed herein). This 
sequence homology extended for >7 kbp upstream from this putative exon 
and also included upstream repetitive Alu elements. Representative 

15 upstream homologous sequences are shown in Figure 3. Similar sequence 
homologies were identified in the 3' untranslated regions of MS4A4E and 
MS4A4A, which extended beyond the poly-A attachment signal sequences 
(Figure 3). Based on the sequence similarities in translated and 
untranslated exons, it appears that the MS4A4E and MS4A4A genes 

20 resulted from a recent gene duplication event. 

The MS4A6E gene encodes 441 bp of translated sequence (Figure 
4), contained within at least four exons (Figure 2). Exons were identified 
based on their sequence similarities with MS4A6A cDNA sequences and the 
identification of canonical splice-donor and -acceptor sites (Aebi & 

25 Weissmann, 1987). In addition, the predicted gene sequences matched 
those found in three cDNA clones that were sequenced (ATCC Nos. 
3704466, 1852248 and 3557769). The MS4A6E gene was at least 5,060 bp 
in length, if counted from the putative translation initiation ATG site until the 
TGA translation termination codon (Figure 2). The MS4A6E gene lacks 

30 exons that encode the first two membrane spanning domains present in 
most MS4A family proteins (Figures 2 and 7). An exon homologous with the 
5' untranslated region of MS4A6A cDNAs was not identified within 7,629 bp 
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of sequence upstream of the exon encoding the translation initiation site of 
MS4A6E. However, there was a canonical 3' splice region upstream of the 
ATG initiation codon located at identical positions in the MS4A6E and 
MS4A6A genes. Similar sequence homologies were identified in the 3' 
5 untranslated regions of MS4A6E and MS4A6A that extend beyond the 
sequence shown in Figure 4. Based on the sequence similarities in 
translated and untranslated exons, it appears that the MS4A6E and 
MS4A6A genes represent a recent gene duplication event, although several 
exons encoding translated sequence were lost in the MS4A6E gene (Figure 
10 2). 

The MS4A10 gene encodes 726 bp of translated sequence (Figure 
5), contained within at least six exons (Figure 2). Exons were identified 
based on their sequence similarities with mouse MS4a10 cDNA sequences 
and the identification of canonical splice-donor and -acceptor sites (Aebi & 

15 Weissmann, 1987). The MS4A10 gene was at least 8,183 bp in length if 
counted from the putative translation initiation ATG site until the TGA 
translation termination stop site (Figure 2). An exon homologous with the 5' 
untranslated region of mouse MS4a10 cDNAs was not identified within 8,829 
bp of sequence upstream of the exon encoding the translation initiation site 

20 of MS4A10. However, there was a canonical 3* splice region upstream of 
the ATG initiation codon located at identical positions in the MS4A10 and 
MS4a10 genes. Modest sequence homologies were identified in the 3 1 
untranslated regions of MS4A10 and MS4a10 (Figure 5). 



WO 02/062946 



44 



PCT/US01/48437 



Table 1 

Human BACs Containing MS4A Genes 



RAP 


Accession No. a 


Chromosome 


MS4A gene b 


RP1 1-206B10 


AC009703 


15 


A4A, A4E, A6A 


RP1 1-21B14 


ACO 13807 


unknown 


A6A, A2, A3 


RP1 1-24D1 


ACO 15840 


unknown 


A4A, A5, A6E, A7 


RP1 1-652L5 


ACO 18966 


11 


A4A, A4E, A6A 


RP1 1-448N3 


AC024066 


11 


A8B 


RP1 1-312N17 


AC027599 


11 


A8B, A10 


RP11-196E16 

1 M It 1 \s >■/ l— • v 


AC027787 


15 


A5, A1 


CMR9-79B2 


AP000748 


11q23 


A10 


RP1 1-804A23 


AP000777 


11 


A10 


RP1 1-736110 


AP000790 


11q12 


A3 


RP11-804B24 


AP000934 


A A 

1 1 


AHA 


RP11-729B4 


AP001034 


11q12 


A5, A12, A1 


CMB9-2M23 


AP001181 


11q12 


A2, A3 


CMB9-100I1 


AP001257 


11q12 


A6A, A4E 


CMB9-49F18 


AP001259 


11 


A8B 


RP11-68H20 


AP001986 


11q 


A10 



a GenBank Accession number for the indicated BAC. 
5 "indicates the MS4A gene sequences that mapped to each BAC. 
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ll.B. MS4A Nomenclature 

In collaboration with the Human Gene Nomenclature Committee 
( www.gene.ucl.ac.uk/nomenclature/) , this gene family was designated as the 
MS4A family (Membrane Spanning 4-domain family, subfamily A). The MS4 
5 designation is to accommodate the future identification of genes encoding 
proteins with a similar structure, yet with unresolved functions. Subfamily A 
will designate the CD20 family. Using this nomenclature, the CD20 gene 
was designated as MS4A 1 , FceR\f5 as MS4A2, and HTm4 as MS4A3. 
Among the 16 novel genes identified, 8 human genes were named MSA AAA, 

10 MS4A4E, MS4A5, MS4A6A, MS4A6E, MS4A 7, MS4A8B, and MS4A12. A 
ninth gene encoded a protein homologous with the single member of the 
mouse MS4a10 subfamily. This gene was tentatively designated as 
MS4A10. The remaining genes were of mouse or pig origin and were 
therefore labeled as MS4a3-MS4a12 based on the nomenclature of 

15 homologous genes corresponding to human counterparts. Distinct mouse 
genes that encoded proteins with highly homologous sequences were 
designated as MS4a4B, MS4a4C, MS4a4D, and as MS4a6B, MS4a6C, and 
MS4a6D to signify close homology. 

II. C. MS4A Gene Chromosome Locations 

20 Chromosome locations for the human MS4A4A, MS4A6A, MS4A7, 

and MS4A8B genes were identified in two distinct homology searches. 
Regions of human MS4A4A (bp 1286-1588), MS4A6A (bp 682-1106), 
MS4A7 (bp 502-941), MS4A7 (bp 1015-1177), and MS4A8B (bp 1007- 
1350), were 98%, 98%, 97%, 99% and 97% identical with human STS 

25 genomic sequence tag sites, WI-11578, SHGC-36634, WI-12101, WIAF- 
3856, and WI-14145, respectively ( http://www.ncbi.nlm.nih.gov/blast) . 
These genomic sequence tag sites are located on human chromosome 1 1 at 
Genomic Database locus D1 1S1357-D1 1S913, which maps to 11q12-13 
( http://www.ncbi.nlm.nih.gov/genemap) . These mapping results were 

30 confirmed using the UniGene collection at the National Center for 
Biotechnology Information ( http://www.ncbi.nlm.nih.gov/Genemap98/) for 
expressed sequence tags identical to human MS4A4A, MS4A6A, MS4A7, 
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MS4A8B sequences. By this analysis, at least 7 of the 9 currently identified 
human MS4A genes are clustered. 

The organization of the 12 MS4A genes on human chromosome 11 
was determined by identifying sequenced human genomic DNA fragments 
5 (contigs of different lengths) from 15 BAC clones (Table 1). Contiguous 
DNA segments for each BAC were constructed based on human MS4A 
exon and cDNA sequences, and overlapping contigs. Although some gaps 
were present in MS4A gene introns (Figure 2) or between MS4A genes, the 
relative position of each gene on chromosome 1 1q1 2-1 3.1 was determined 

10 (Figure 6). MS4A1 was located in a telemetric region of 1 1 q1 2-1 3.1 
compared with MS4A2 and MS4A3. Seven MS4A genes were located in 
between MS4A1 and MS4A2. Two other MS4A genes, MS4A8B and 
MS4A10 were centromeric to MS4A2 and MS4A3, although the distance 
between these genes was not determined. Interestingly, MS4A6A, 

15 MS4A4E, MS4A4A and MS4A6E were arranged linearly suggesting that 
these genes might have arisen through the duplication of a single genomic 
element. It is envisioned that this genetic locus extends further and contains 
additional MS4A genes. 

II.A. MS4A Gene Structure 

20 Complete coding region sequences were verified for each deduced 

protein, except for the MS4a3 cDNA that was not full-length (Figure 1). 
Proposed ATG translation initiation codons were based on the translation 
initiation consensus sequence, ANN ATG (Kozak (1986) Cell 44:283-292), 
and the existence of in-frame upstream translation stop codons in most 

25 cases. Whether the first or second ATG codon in mouse MS4a8B was used 
for translation initiation was unknown although the second ATG was identical 
with the start codon of human MS4A8B (Figure 7). 

Poly(A) attachment signal sequences were identified in the proximal 
3 f untranslated regions of each gene product except MS4A6A, MS4A6E, 

30 MS4A10, and MS4a6C. Two poly(A) signal sequences were found in 
MS4a4D, MS4A5, and MS4a10 transcripts, while four were observed in 
MS4A4A transcripts. 
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The disclosed MS4A cDNAs were further used to annotate the 
genomic sequence derived from BAC clones. Annotated features include 
definition of coding regions, intronjexon junctions, sequences upstream of 
the initial coding region of each gene that comprise the promoter region, and 
5 other adjacent sequences that could also comprise gene regulatory 
elements. Representative methods for further characterizing a MS4A 
promoter region are disclosed in Example 9. 

Annotation of human MS4A genomic regions (SEQ ID NOs:73-81), as 
disclosed herein, enabled a comparison of gene structure among MS4A 

10 genes. The overall domain organization of each MS4A gene was similar 
(Figures 2 and 7). All exon|intron|exon boundaries were consistent with 
consensus splice-donor and -acceptor sequences unless otherwise 
indicated, with exon|GTGAGT-intron-CAG|exon sequences in most cases 
(Aebi & Weissmann, 1987). In addition, the splice junctions for all translated 

15 exons were located after the third nucleotide in each codon. Most MS4A 
proteins were encoded by 6 exons except MS4A2, MS4A5, and MS4A6E 
(Figure 2 and 7). In these exceptions: the N-terminal cytoplasmic domain of 
MS4A2 was encoded by two exons (Kuster et al., 1992); the MS4A5 and 
MS4A6E genes did not encode C-terminal cytoplasmic domains; and the 

20 MS4A6E gene had only two membrane spanning domains. Intron lengths 
demonstrated wide variation from 181 bp in MS4A12 to 13,731 bp in 
MS4A5. In some cases however, exact intron lengths were not determined; 
MS4A3, MS4A4, and MS4A12 (Figure 2). Distances between translation 
initiation and termination codons were determined for most MS4A genes; 

25 with MS4A6E being the smallest (5,060 bp) and MS4A4E being the longest 
(23,379 bp) genes (Figure 6). Thus, the intron|exon organization of all 
MS4A family members is consistent with the high degree of conservation 
within this gene family. 

There were no amino-terminal signal sequences, although all MS4A 

30 proteins contained hydrophobic regions of sufficient length to pass through 
the membrane at least four times. Notable was a marked clustering of 
charged residues at both ends of the putative transmembrane domains, 
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some of which were highly conserved. In some cases, the first and second 
putative transmembrane domains of MS4A proteins were a continuous 
stretch of hydrophobic amino acids without an obvious inter-transmembrane 
hydrophilic bridge. By contrast, MS4A4A and MS4A7 had 6 to 7 hydrophilic 
5 amino acids inserted between the first and second hydrophobic domains. In 
human MS4A4A and mouse MS4a4B, MS4a4C, and MS4a4D, an extensive 
hydrophobic region followed the fourth putative membrane-spanning 
domain. Thus, the overall structure of MS4A family members was well 
conserved. 

10 NjL MS4A Gene Splice Variants 

Among the MS4A cDNAs sequenced and EST sequences analyzed, 
multiple splice variants were identified that encoded variant MS4A proteins. 
In most cases, exons were spliced out, which generated truncated protein 
products. Potential splice variants of the MS4A4A, MS4A5, MS4A6A, and 

15 MS4A7 genes were identified. Whether these alternatively spliced variants 
produce functional proteins has yet to be determined. 

Two splice variations of the MS4A4A gene were identified during an 
analysis of MS4A4A mRNA expression by lymphoblastoid cell lines. Most of 
the hematopoietic cell lines examined expressed transcripts encoding a full- 

20 length MS4A4A protein as shown in Figure 7. However, a second smaller 
transcript was also expressed in most cases that contained a potential exon 
deletion of 158 nucleotides. This was a frequent event since 40% of 
MS4A4A cDNAs generated from the BJAB B cell line encoded the truncated 
protein. In addition, the same splicing event was observed in two of five 

25 EST sequences that covered this region of the MS4A4A protein. Splicing- 
out this potential exon deleted the third membrane-spanning domain and the 
second extracellular loop from the full-length protein (positions 110-163, 
Figure 3). Of interest, this splicing event fused the first/second membrane 
spanning domains with the fourth membrane spanning domain. However, 

30 the fourth transmembrane spanning domain in MS4A4A is followed by 
another hydrophobic region of sufficient length to traverse the membrane 
(disclosed herein). This suggests that differential splicing can generate an 
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alternative MS4A4A protein with four membrane spanning domains lacking a 
significant extracellular domain. 

In the case of the MS4A5 gene, two of nine MS4A5 EST sequences 
analyzed (GenBank Accession Nos. AA411806 and AA781801) encoded a 
5 splice variant that preserved the reading frame of the transcript. In both 
sequences, the exon encoding the third membrane-spanning domain and 
the second extracellular loop from the full-length protein (TM3, Figure 1) was 
spliced out using normal splice-donor and -acceptor sequences, which 
deleted 51 amino acids (114-164) from the full length protein (Figure 7). 

10 This deletion resulted in a protein with the first/second membrane spanning 
domains fused with the fourth predicted membrane-spanning domain. Thus, 
the truncated MS4A5 protein would possess three membrane-spanning 
domains with an extracellular carboxyl-terminal domain. 

A novel splicing event was observed in the MS4A6A gene which 

15 resulted in a truncated protein. A novel splice donor site (CAG T 683 |GT GAG 
T) is located within the exon encoding the TM3/extracellular loop domains 
(Figure 4). This cryptic splice donor site was spliced with the normal 3' 
splice acceptor site of the exon encoding the TM4 domain, which thereby 
deletes nucleotides 684-787 from MS4A6A transcripts (Figure 4). Since 

20 there was an extra T introduced into the codon sequence due to this 
alternative splicing event, there was a frameshift in the coding sequence. 
This potentially results in the attachment of a novel 30 amino acid sequence 
(-WNSLSDADLHSAGILPSCAHCCAAVETGLL) that is not predicted to be 
hydrophobic. Thus, the variant MS4A protein would be 70 amino acids 

25 shorter and would lack the fourth membrane-spanning and cytoplasmic 
domains. This alternative splicing event was found in 3 of 29 EST 
sequences that encoded this region (GenBank Accession Nos. AI278475, 
AA461046, and AA448335) and in one cDNA clone (GenBank Accession 
No. AB013104). 

30 Splice variation in MS4A7A transcripts produces two distinct protein 

products in addition to the presumably normal protein. In one case, a splice 
variation in MS4A7A transcripts produces a protein product similar in 
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structure to the MS4A6E protein. The exon encoding the first/second 
membrane spanning domains (amino acids 50-94, Figure 7) was deleted in 
2 of 4 MS4A7 EST sequences analyzed (GenBank Accession Nos. N42191 
and R1 1 179) that cover this region. Thus, the protein product would have a 
5 longer N-terminal cytoplasmic domain and only two membrane spanning 
domains. In the second case, the exon encoding the fourth membrane- 
spanning domain (amino acids 183-216) was deleted in 2 EST sequences 
(GenBank Accession Nos. R11180 and AM 88478) out of 18 sequences 
analyzed (Figure 7). 

10 II.F. MS4A Gene Polymorphisms 

Putative polymorphisms were identified in the MS4A6A gene. Two 
nucleotide substitutions were found in cDNA clone ATCC No. 499181 and in 
13 of 38 EST sequences analyzed (Figure 1). The first substitution was at 
nucleotide 373 that exchanged a C for a T, which did not alter the amino 

15 acid sequence. The second substitution resulted in a Ser in place of Thr at 
amino acid 185. In addition, a third substitution was found in 4 of the 38 
EST sequences analyzed where a Ser was substituted in place of an Ala at 
amino acid position 183. This substitution was paired with a Ser to Thr 
substitution at amino acid position 185 in half of the clones analyzed. These 

20 differences most likely represent common sequence polymorphisms since 
they were observed in multiple independent cDNA clones. Based on our 
genetic DNA analysis, it is unlikely that these differences could represent 
transcripts from distinct genes that are almost identical in coding sequence. 
As with the MS4A6A gene (disclosed herein), potential gene 

25 polymorphisms were observed in MS4A6E. Three cDNA clones 
representing partial transcripts were sequenced completely on both strands. 
The predicted MS4A6E gene product and one cDNA clone (ATCC No. 
3704466) had identical sequences. However, the ATCC No. 3557769 cDNA 
had a nucleotide substitution at position 314 (Figure 4) that exchanged a T 

30 for a C, which did not alter the predicted amino acid sequence. The ATCC 
No. 1852248 cDNA clone had the longest insert that starts at nucleotide 
position 60 and ended at position 661 as shown in Figure 4. This cDNA had 
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a substitution at nucleotide 153 that exchanged a G for a T, which resulted in 
a Phe in place of Val at amino acid 47 (Figure 4). Therefore, sequence 
polymorphisms can exist within the MS4A6E gene. 

Other potential polymorphisms were observed in other MS4A family 
5 members based on consistent nucleotide variations found in MS4A4E 
sequences. 

The assembly and annotation of genomic sequences comprising 
MS4A genes in the region of human chromosome 11q12-13.1, disclosed 
herein for the first time, provide source material for identification of 

10 polymorphisms that are linked to MS4A genes. Such polymorphisms can 
include single nucleotide polymorphisms as disclosed within the MS4A6A 
and MS4A6E coding region sequences. In addition, polymorphisms within 
or genetically linked to MS4A genes can also comprise restriction length 
polymorphisms (RFLPs) (Lander & Botstein (1989) Genetics 121:185-199), 

15 short tandem repeat polymorphisms (STRPs), short sequence length 
polymorphisms (SSLPs) (Dietrich et al. (1996) Nature 380:149-152), 
amplified fragment length polymorphisms (AFLPs) (Latorra et al. (1994) PCR 
Methods Appl 3(6):351-358), and microsatellite markers (Schalkwyk et al. 
(1999) Genome Res 9:878-887). Identification of polymorphisms within an 

20 isolated DNA molecule are known to one of skill in the art. 

II. G. MS4A Proteins 

The MS4A genes encoded proteins of 16-29 kDa (Table 2). 
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Table 2 
MS4A Family Members 



Human 


Mouse 


Human/Mouse 


Name 


kua 


Name 


Kua 


nui i nji\jy y 






MS4a3 




63% (partial) 


MS4A4A 


23 












Ms4a4B 


24 


41% 






Ms4a4C 


24 


44% 






Ms4a4D 


24 


40% 


MS4A4E 


24 








MS4A5 


22 








MS4A6A 


27 












Ms4a6B 


27 


52% 






Ms4a6C 


24 


51% 






Ms4a6D 


26 


53% 


MS4A6E 


16 








MS4A7 


26 


MS4a7 


26 


53% 


MS4A8B 


26 


MS4a8B 


29 


63% 


MS4A10 


27 


MS4a10 


29 


52% 


MS4A12 


26 


MS4a12( P ig) 


26 


60% 



a Predicted molecular weights for the new MS4A family members and the 
5 percentage amino acid sequence identity between deduced MS4A and 
MS4a proteins. 
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Comparisons between CD20 and the predicted amino acid 
sequences for human MS4A4A, MS4A5, MS4A6A, MS4A7, MS4A8B, and 
MS4A12 revealed 23-29% amino acid sequence identity (Figure 7). The 
highest degree of identity was found in the first three transmembrane 
5 domains with multiple regions of conserved amino acids. In particular, the 
amino acid sequences LGAXQI (SEQ ID NO:57) and LSLG (SEQ ID NO:58) 
were common within the first transmembrane domain, GYPFWG (SEQ ID 
NO:60) and FIISGSLS (SEQ ID NO:61) were common in the second 
domain, and SLX2NX2SX3AX2G (SEQ ID NO:62) was found in the third 

10 transmembrane domain, the first and second transmembrane domains of 
MS4A8B were 46% identical in amino acid sequence with human CD20, 
41% identical with FceRip, and 39% identical with HTm4. The MS4A4A, 
MS4A5, MS4A6A, and MS4A7 proteins were most homologous in their first 
and second transmembrane domains with the human FcsRip chain, with 37- 

15 46% amino acid sequence identity. There was large variation between 
MS4A proteins in the N- and C-terminal cytoplasmic domains. However, 
Pro residues were significantly over-represented within the N- and C- 
terminal cytoplasmic domains of most MS4A family members. There was 
some sequence identity in the first potential extracellular loop that was -13 

20 amino acids in length for each protein. By contrast, the second predicted 
extracellular loop ranged from 10-46 amino acids in length with diverse 
sequences. 

The putative MS4A4E gene encodes a 220 amino acid protein of 23.8 
kDa with a predicted amino acid sequence that is 76% identical with the 

25 MS4A4A protein (Figure 3). Consistent with other MS4A proteins, the most 
significant homologies between MS4A4E and other MS4A family members 
were found in the membrane spanning domains (Figure 7). Common amino 
acid motifs were readily visualized such as KXLGAIQI (SEQ ID NO:57), 
GYPXWG (SEQ ID NO:60), and SGXLSI (SEQ ID NO:59) in the first and 

30 second hydrophobic regions that represent potential transmembrane 
regions. The intracellular N- and C-terminal domains were highly conserved 
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between MS4A4E and MS4A4A, but were divergent from other family 
members. 

The putative MS4A6E gene encodes a 147 amino acid protein of 15.9 
kDa with a predicted amino acid sequence that is 78% identical with the 
5 MS4A6A protein (Figure 4). The most significant homologies between 
MS4A6E and other MS4A family members were found in the membrane 
spanning domains, although MS4A6E only had two (TM3 and TM4) 
membrane-spanning domains (Figures 4 and 7). The putative second 
extracellular loops of MS4A6E and MS4A6A were of identical length (Figure 

10 4). Common amino acid motifs were readily visualized in the hydrophobic 
regions that represent potential transmembrane regions. The intracellular N- 
terminal domain was highly conserved between MS4A6E and MS4A6A, but 
were divergent from other family members. MS4A6E protein also lacks a C- 
terminal cytoplasmic domain (Figure 4). 

15 The putative MS4A10 gene encodes a translated 241 amino acid 

protein of 26.9 kDa with a predicted amino acid sequence that is 52% 
identical with the mouse MS4a10 protein (Figure 5). The most significant 
homologies between MS4A10 and MS4a10 were found in the membrane 
spanning domains and the putative second extracellular loop (Figure 5). 

20 Although the N-terminal cytoplasmic domains of MS4A10 and MS4a10 were 
of similar length, the intracellular N- and C-terminal domains had the lowest 
sequence homologies among domains. The cytoplasmic C-terminal domain 
was 28 amino acids shorter in MS4A10 than MS4a10. Nonetheless, based 
on the sequence similarities of translated regions, it appears that MS4A10 

25 and MS4a10 represent homologous genes that are more similar to one 
another than other MS4A family members. 

Ten novel mouse MS4A proteins were identified that shared 40-63% 
amino acid sequence identity with their potential human counterparts (Figure 
7, Table 2). For comparison, the mouse and human CD20 proteins are 74% 

30 identical in amino acid sequence (Tedder et al., 1988a). A single partial 
cDNA was identified that encoded the mouse homologue for HTm4 (MS4a3, 
Figure 7). The predicted amino terminus of the proposed MS4a3 protein 
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was 23 amino acids shorter than in the human protein, although their 
overlapping regions were 63% identical in amino acid sequence. In all 
cases, the transmembrane domains of the human and mouse MS4A 
proteins were the most well conserved regions. For example, the human 
5 MS4A8B protein was 78% identical in sequence to MS4a8B in the first 3 
transmembrane domains and 68% identical in domain 4. Additional MS4A 
genes are likely to be identified in humans and mice, including the mouse 
MS4A5 homologue. 

A UPGMA (unweighted pair group method using arithmetic averages) 
10 tree showing relatedness of deduced MS4A and MS4a protein sequences is 
depicted in Figure 8. 

III. Methods for Detecting a MS4A Nucleic Acid Molecule 

In another aspect of the invention, a method is provided for detecting 

15 a nucleic acid molecule that encodes a MS4A polypeptide. According to the 
method, a biological sample having nucleic acid material is procured and 
hybridized under stringent hybridization conditions to a MS4A nucleic acid 
molecule of the present invention. Such hybridization enables a nucleic acid 
molecule of the biological sample and the MS4A nucleic acid molecule to 

20 form a detectable duplex structure. Preferably, the MS4A nucleic acid 
molecule includes some or all nucleotides of any one of the odd-numbered 
SEQ ID NOs:1-37. Also preferably, the biological sample comprises human 
nucleic acid material. 

III.A. Expression of MS4A Family Members in Hematopoietic Cells 

25 Since CD20, FceRip, and HTm4 expression are restricted to 

hematopoietic tissues, MS4A gene transcription was assessed by PCR 
amplification of cDNA from eleven human hematopoietic cell lines. Like 
CD20, MS4A8B was only expressed by B cell lines (Table 3). MS4A5 was 
only expressed by a promonocyte cell line. MS4A6A transcripts were 

30 expressed by B cell, myelomonocytic, and erythroleukemia cell lines. 
MS4A4A mRNA was expressed by all cell lines examined, although the 
relative mRNA levels varied significantly. MS4A7 was expressed in most, 



WO 02/062946 



56 



PCT/US01/48437 



but not all of the cell lines tested. MS4A12 transcripts were not detected in 
these cell lines. Thus, most MS4A family members are likely to be 
expressed in hematopoietic tissues. 

ESTs encoding MS4A transcripts were isolated from a variety of 
5 different cDNA libraries. MS4A4A ESTs were from aorta, brain, breast, 
heart, kidney, lung, ovary, pancreas, placenta, prostate, stomach, testis, and 
uterine tissues. MS4A5 ESTs were only isolated from testis. MS4A6A ESTs 
were from aorta, brain, the central nervous system, colon, gall bladder, 
heart, kidney, lung, muscle, ovary, pancreas, placenta, prostate, skin, 

10 stomach, tonsil, uterus and embryonic tissues. MS4A7 ESTs were from 
lung, kidney, lymphocytes, mammary gland, placenta, spleen, testis, thymus, 
and uterine tissues. MS4A8B ESTs were from brain, lung, uterus and 
embryonic tissues. A single MS4A12 EST was isolated from colon. This 
demonstrates differential MS4A gene transcription among lymphoid and 

1 5 non-lymphoid tissues. 
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Since most of the MS4A genes are expressed by hematopoietic cells, 
MS4A4E, MS4A6E and MS4A10 transcription were assessed by RT-PCR 
amplification of cDNA from human hematopoietic cell lines and human 
tissues. Transcripts from eleven human hematopoietic cell lines were 
5 evaluated; one pre-B cell line (NALM-6), three B cell lines (BJAB, DAUDI, 
and SB), four T cell lines (HSB-2, HUT-78, JURKAT, and MOLT15), two 
myelomonocytic lines (HL60 and U937), and one erythroleukemia cell line 
(K562). In addition, transcripts from eight human tissues were evaluated; 
colon, ovary, peripheral blood leukocytes, prostate, small intestine, spleen, 

10 testes and thymus. However, MS4A4E, MS4A6E and MS4A10 transcripts 
were not detected in any of these cell lines or tissues. 

MS4A4E, MS4A6E, and MS4A10 sequences were also used to 
search the translated GenBank databases using the BLAST program 
(Altschul et al., 1997). Eleven EST sequences representing MS4A6E 

15 transcripts were found that represented nine cDNAs isolated from pooled 
fetal organ libraries (GenBank Accession Nos. AA382998, AA909515, 
AA917066, AI222355, AI279944, AI684553, AI699419, AI743473, 
AI806247), one cDNA from a pooled germ cell tumor library (GenBank 
Accession No. AI968835), and one cDNA from a colon tumor (GenBank 

20 Accession No. AW951636). EST cDNAs encoding MS4A4E or MS4A10 
sequences were not identified. This suggests that MS4A4E, MS4A6E, and 
MS4A10 transcripts are rare among normal tissues or they are primarily 
expressed during oncogenesis or embryogenesis. 

MS4a gene expression by mouse tissues was assessed by Northern 

25 analysis and PCR amplification of cDNAs (Table 4). In most cases 
assessed, Northern analysis failed to detect specific MS4a transcripts in 
tissues that revealed transcript production by PCR amplification. These 
results suggest that MS4a transcripts are only produced by subpopulations 
of cells within each tissue such that transcript levels were often below the 

30 level of detection by Northern analysis. Nonetheless, MS4a4B, MS4a4C, 
and MS4a6B transcripts were found at high levels in thymus, spleen and 
peripheral lymph nodes, with less abundant levels in non-iymphoid tissues. 
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MS4a6C was only expressed by thymus, spleen, PLN and bone marrow. 
MS4a4C, MS4a6D and MS4a7 were expressed in all tissues examined. 
MS4a8B transcripts were expressed by spleen, peripheral lymph nodes, 
colon, liver, heart, lung and bone marrow. MS4a 10 transcripts were found in 
5 thymus, kidney, colon, brain, and testis. In addition, CD20 (MS4a1), FceRip 
(MS4a2), and MS4a3 expression were primarily restricted to hematopoietic 
tissues. MS4a3, MS4a4B, MS4a4C, MS4a6B, MS4a6C, MS4a6D, MS4a7, 
MS4a8B, and MS4a10 were also expressed by various hematopoietic and 
lymphoblastoid cell lines. Therefore, most MS4a family members were 
10 expressed by hematopoietic cells. 

Table 4 

MS4a Gene Expression by Mouse Tissues 3 



MS4a 


Thymus 


Spleen 


PLN 


BM 


Liver 


Kidney 


Heart 


Colon 


Lung 


Brain 


Teste 


1 


+ 


+++ 


4-4-4- 


+ 










+ 






2 


+ 


+ 


+ 


+++ 




+ 






+ 






3 


+ 


+ 


+ 


4-4-4- 










+ 


+ 




4B 


+++ 


+++ 


+++ 


++ 


+ 


+ 


+ 


+ 


+ 






4C 


4-4-4- 


+++ 


4-4-4- 


+++ 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


4D 


+ 


+ 


++ 




+ 


+ 


++ 


++ 


++ 




+ 


6B 


4-4-4- 


+++ 


+++ 


++ 


+ 




+ 


+ 


+ 




++ 


6C 


+ 


+ 


+ 


++ 
















6D 


+++ 


+++ 


+++ 


++ 


+++ 


+++ 


+++ 


+++ 


+++ 


+++ 


++4 


7 


++ 


++ 


++ 


++ 


+ 


+ 


+ 


++ 


+ 


+ 


+ 


8B 




+ 


+ 


+ 


+ 




+ 


++ 


+ 






10 


+ 










+ 




+ 




+ 


++ 


G3PDH 


4-4-4- 


+++ 


+++ 


+++ 


+++ 


+++ 


+++ 


+++ 


+++ 


+++ 


+ + 4 



a Gene transcription was assessed by PCR amplification of cDNA generated 
from mRNA isolated from tissue samples. Values represent the level of 
15 PCR product generated relative to the glyceraldehyde-3-phosphate 
dehydrogenase (G3PDH) control as described for Table 3. Peripheral lymph 
node (PLN) and bone marrow (BM). 
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Expression of MS4A family members was also assessed in mouse 
hematopoietic cell lines (Table 5). Nine of the twelve MS4A genes were 
expressed in pre-B cell lines and five of the MS4A genes were expressed in 
B cell lines. Six of the MS4A genes were expressed by T cell lines. These 
5 data suggest that B cells can express most members of the MS4A gene 
family, although the patterns of expression of each gene is distinct. 

Table 5 

MS4a Expression by Mouse Lymphoid Tissues and Cell Lines 3 

Tissues Pre B cell lines B cell lines T cell lines 
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10 



a Gene transcription was assessed by PCR amplification of cDNA generated 
from mRNA isolated from each cell type. Values represent the level of PCR 
product generated relative to the glyceraldehyde-3-phosphate 
dehydrogenase (G3PDH) control in three separate PCR reactions: no 
15 specific PCR product detected; +, low levels of the appropriate band were 
detectable; ++ to +++, appropriate bands of increasing intensity were readily 
visualized in all samples examined. Identical results were obtained using 
two different primer pairs for cDNA amplification. 
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MLB. Detection of MS4A Polymorphisms 

In another embodiment, genetic assays based on nucleic acid 
molecules of the present invention can be used to screen for genetic 
variants by a number of PCR-based techniques, including single-strand 
5 conformation polymorphism (SSCP) analysis (Orita, M., et al. (1989) Proc 
Natl Acad Sci USA 86(8):2766-2770), SSCP/heteroduplex analysis, enzyme 
mismatch cleavage, and direct sequence analysis of amplified exons (Kestila 
et al. (1998) Mol Cell 1(4):575-582; Yuan et al. (1999) Hum Mutat 14(5):440- 
446). Automated methods can also be applied to large-scale 

10 characterization of single nucleotide polymorphisms (Brookes (1999) Gene 
234(2): 177-1 86; Wang et al. (1998) Science 280(5366): 1077-82). The 
present invention further provides assays to detect a mutation of a variant 
MS4A locus by methods such as allele-specific hybridization (Stoneking et 
al. (1991) Am J Hum Genet 48(2):370-82), or restriction analysis of amplified 

1 5 genomic DNA containing the specific mutation. 

IV. Recombinant Production of a MS4A Polypeptide 

The present invention also provides a method for recombinant 
production of a MS4A polypeptide, as described in Example 3. Preferably, 
20 the recombinant polypeptide comprises some or all of the amino acid 
sequences of any one of the even-numbered SEQ ID NOs:2-38. 

Recombinantly produced proteins are useful for a variety of purposes, 
including structural determination of a MS4A polypeptide, generation of an 
antibody that recognizes a MS4A polypeptide, and screening assays to 
25 identify a chemical compound or peptide that interacts with a MS4A 
polypeptide, described further herein below. 

V. Production of MS4A Antibodies 

In another aspect, the present invention provides a method of 
30 producing an antibody immunoreactive with a MS4A polypeptide, the 
method comprising recombinantly or synthetically producing a MS4A 
polypeptide, or portion thereof, to be used as an antigen. The MS4A 
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polypeptide is formulated so that it is can be used as an effective 
immunogen. An animal is immunized with the formulated MS4A 
polypeptide, generating an immune response in the animal. The immune 
response is characterized by the production of antibodies that can be 
5 collected from the blood serum of the animal. Optionally, cells producing a 
MS4A antibody can be fused with myeloma cells, whereby a monoclonal 
antibody can be selected. Exemplary methods for producing a monoclonal 
antibody that recognizes a MS4A protein are described in Example 4. 
Preferred embodiments of the method use a polypeptide set forth as any 

1 0 one of the even-numbered SEQ ID NOs:2-38. 

The present invention also encompasses antibodies and cell lines 
that produce monoclonal antibodies as described herein. 

The foregoing antibodies can be used in methods known in the art 
relating to the localization and activity of the MS4A polypeptide sequences of 

15 the invention, e.g., for cloning of MS4A nucleic acids, immunopurification of 
MS4A polypeptides, imaging MS4A polypeptides in a biological sample, 
measuring levels thereof in appropriate biological samples, and in diagnostic 
methods. 

20 VL Methods for Detecting a MS4A Polypeptide 

In another aspect of the invention, a method is provided for detecting 
a level of MS4A polypeptide using an antibody that specifically recognizes a 
MS4A polypeptide, or portion thereof. In a preferred embodiment, biological 
samples from an experimental subject and a control subject are obtained, 

25 and MS4A polypeptide is detected in each sample by immunochemical 
reaction with the MS4A antibody. More preferably, the antibody recognizes 
amino acids of any one of the even-numbered SEQ ID NOs:2-38, and is 
prepared according to a method of the present invention for producing such 
an antibody. 

30 In one embodiment, a MS4A antibody is used to screen a biological 

sample for the presence of a MS4A polypeptide. A biological sample to be 
screened can be a biological fluid such as extracellular or intracellular fluid, 
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or a cell or tissue extract or homogenate. A biological sample can also be 
an isolated cell (e.g., in culture) or a collection of cells such as in a tissue 
sample or histology sample. A tissue sample can be suspended in a liquid 
medium or fixed onto a solid support such as a microscope slide. In 
5 accordance with a screening assay method, a biological sample is exposed 
to an antibody immunoreactive with a MS4A polypeptide whose presence is 
being assayed, and the formation of antibody-polypeptide complexes is 
detected. Techniques for detecting such antibody-antigen conjugates or 
complexes are well known in the art and include but are not limited to 
10 centrifugation, affinity chromatography and the like, and binding of a labeled 
secondary antibody to the antibody-candidate receptor complex. 

In one embodiment, an antibody that specifically recognizes a MS4A 
polypeptide can be used to assess the tissue- or cell-distribution of MS4A 
protein, for example, to evaluate CD20 expression during B lymphocyte 

15 development (Figure 9). CD20 expression in B220 + lymphocytes from 
lymphoid tissues of wild type mice was examined by two-color 
immunofluorescence. In bone marrow, three types of B220 + cells were 
detected. The vast majority of B220 hl lymphocytes expressed CD20. 
However, the majority of B220 l0 lymphocytes were CD20-negative. Thus, 

20 CD20 was predominantly expressed by mature B cells. 

CD19 expression is restricted to normal and neoplastic B cells and 
follicular dendritic cells. CD19 is expressed early by B progenitor cells in the 
bone marrow, presumably at the late pro-B or early pre-B cell stages around 
the time of immunoglobulin heavy chain rearrangement (Anderson et al. 

25 (1984) Blood 63:1424). Expression persists during all stages of B cell 
maturation and is lost upon terminal differentiation to plasma cells. 

Double staining of CD20 with IgM and CD19 antibodies showed that 
some of the CD19 10 and IgM 10 cells were CD20 negative in the bone marrow. 
A few IgM- cells also expressed low levels of CD20 in the bone marrow. 

30 This data suggested that the CD20 expression was later than the CD19 
expression but before or around the time of IgM expression during B cell 
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development in the bone marrow since these cells were gated on 
lymphocytes not dendritic cells. 

Th e | eve | D f CD20 expression observed on mature B220 hi B cells in 
bone marrow was maintained by B cells from peripheral lymphoid tissues. 
5 The vast majority of B220 + B cells in the spleen, blood, peripheral lymph 
nodes, and peritoneal cavity expressed CD20. Therefore, like human CD20, 
mouse CD20 was also exclusively expressed on B cells from the immature B 
cell stage to mature B cells. 

10 V1L Identification of MS4A Modulators 

VILA. Screening for Small Molecule Liqands that Interact with a 

MS4A Polypeptide 
The present invention further discloses a method for identifying a 
compound that modulates MS4A function. According to the method, a 
15 MS4A polypeptide is exposed to a plurality of compounds, and binding of a 
compound to the isolated MS4A polypeptide is assayed. A compound is 
selected that demonstrates specific binding to the isolated MS4A 
polypeptide. Preferably, the MS4A polypeptide used in the binding assay of 
the method includes some or all amino acids of any one of the even- 
20 numbered SEQ ID NOs:2-38. 

Several techniques can be used to detect interactions between a 
protein and a chemical ligand without employing an in vivo ligand. 
Representative methods include, but are not limited to, Fluorescence 
Correlation Spectroscopy, Surface-Enhanced Laser Desorption/lonization 
25 Time-Of-flight Spectroscopy, and Biacore technology, as described in 
Example 5. These methods are amenable to automated, high-throughput 
screening. 

Candidate regulators include but are not limited to proteins, peptides, 
and chemical compounds. Structural analysis of these selectants can 
30 provide information about ligand-target molecule interactions that enable the 
development of pharmaceuticals based on these lead structures. 
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Similarly, the knowledge of the structure a native MS4A polypeptide 
provides an approach for rational drug design. The structure of a MS4A 
polypeptide can be determined by X-ray crystallography or by computational 
algorithms that generate three-dimensional representations. See Huang et 
5 al. (2000) Pac Symp Biocomput 230-41; Saqi et al. (1999) Bioinformatics 
15:521-522. Computer models can further predict binding of a protein 
structure to various substrate molecules, that can be synthesized and tested. 
Additional drug design techniques are described in U.S. Patent Nos. 
5,834,228 and 5,872,011. 

10 Vil.B. Methods for Identifying Modulators of MS4A Gene Expression 

The assembly and annotation of genomic sequences comprising 
MS4A genes in the region of human chromosome 1 1 q1 2-13.1 , disclosed 
herein for the first time, identify MS4A gene regulatory regions. Preferably, 
MS4A gene regulatory regions comprise sequences upstream of the initial 

15 coding region of each MS4A gene as disclosed in SEQ ID NOs:73-81. An 
expression cassette comprising a MS4A promoter region can be employed 
in assays for the identification of modulators of MS4A expression. Thus the 
present invention also provides a method for identifying a substance that 
regulates MS4A gene expression using a chimeric gene that includes an 

20 isolated MS4A gene promoter region operably linked to a reporter gene. 
According to this method, a gene expression system is established that 
includes the chimeric gene and components required for gene transcription 
and translation so that reporter gene expression is assayable. To select a 
substance that regulates MS4A gene expression, the method further 

25 provides the steps of using the gene expression system to determine a 
baseline level of reporter gene expression in the absence of a candidate 
regulator; providing one or more candidate regulators to the gene expression 
system; and assaying a level of reporter gene expression in the presence of 
a candidate regulator. A candidate regulator is selected whose presence 

30 results in an altered level of reporter gene expression when compared to the 
baseline level. 

Several molecular cloning strategies can be used to identify 
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substances that specifically bind a MS4A gene cis-regulatory element. In 
one embodiment, a cDNA library in an expression vector, such as the 
Iambda-gt1 1 vector, can be screened for cDNA clones that encode a MS4A 
gene regulatory element DNA-binding activity by probing the library with a 
5 labeled MS4A DNA fragment, or synthetic oligonucleotide (Singh et al. 
(1989) Biotechniques 7:252-261). Preferably, the nucleotide sequence 
selected as a probe has already been demonstrated as a protein binding site 
using a protein-DNA binding assay, as described in Example 9. 

In another embodiment, transcriptional regulatory proteins are 

10 identified using the yeast one-hybrid system (Luo et al. (1996) Biotechniques 
20(4):564-568; Vidal et al. (1996) Proc Natl Acad Sci USA 93(1 9): 1031 5- 
10320; Li & Herskowitz (1993) Science 262:1870-1874). In this case, a cis- 
regulatory element of a MS4A gene is operably fused as an upstream 
activating sequence (UAS) to one, or typically more, yeast reporter genes 

15 such as the lacZ gene, the URA3 gene, the LEU2 gene, the HIS3 gene, or 
the LYS2 gene, and the reporter gene fusion construct(s) is inserted into an 
appropriate yeast host strain. It is expected that the reporter genes are not 
transcriptionally active in the engineered yeast host strain, for lack of a 
transcriptional activator protein to bind the UAS derived from the MS4A gene 

20 promoter region. The engineered yeast host strain is transformed with a 
library of cDNAs inserted in a yeast activation domain fusion protein 
expression vector, e.g. pGAD, where the coding regions of the cDNA inserts 
are fused to a functional yeast activation domain coding segment, such as 
those derived from the GAL4 or VP16 activators. Transformed yeast cells 

25 that acquire a cDNA encoding a protein that binds a cis-regulatory element 
of a MS4A gene can be identified based on the concerted activation the 
reporter genes, either by genetic selection for prototrophy (e.g. LEU2, HIS3, 
or LYS2 reporters) or by screening with chromogenic substrates (e.g., a lacZ 
reporter) by methods known in the art. 

30 The present invention also provides an in vivo assay for discovery of 

modulators of MS4A gene expression. In this case, a transgenic non-human 
animal is made such that a transgene comprising a MS4A gene promoter 
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and a reporter gene is expressed and a level of reporter gene expression is 
assayable. Such transgenic animals can be used for the identification of 
compounds that are effective in modulating MS4A gene expression. 
In vitro or in vivo screening approaches can also survey more than one 
5 modulatable transcriptional regulatory sequence simultaneously. 

VIII. Animal Models 

The present invention further pertains to an animal model of disorders 
associated with a MS4A nucleic acid or polypeptide, including but not limited 

10 to atopic disorders, abnormal target cell development, function, and Ca ++ 
responses. Such a model can be prepared by several methods. Using a 
transgenic approach, knock-out, knock-in, or knock-down mutation of the 
MS4A gene can suppress MS4A function. The present invention also 
teaches that an animal model of a MS4A- related disorder can be prepared 

15 by immunizing an animal with a MS4A polypeptide. The resulting immune 
response in the animal comprises a production of antibodies that specifically 
bind a MS4A polypeptide, thereby disrupting its biological activity. A method 
is also provided for generating an animal model of a MS4A-related disorder 
by administering to an animal a compound that disrupts MS4A expression or 

20 function. Such a compound is discovered by methods disclosed herein. 

VIII.A. Generation of CD20-Deficient Mice 

CD20-deficient mice were generated by targeted disruption of the 
CD20 gene in embryonic stem (ES) cells using homologous recombination, 
as described in Example 6. A targeting vector was generated that replaces 

25 exons encoding part of the second extracellular loop, the 4 th transmembrane 
domain, and the large car boxy l-terminal cytoplasmic domain of CD20 with a 
neomycin resistant gene (Figure 10A-D). Appropriate gene targeting 
generates an aberrant CD20 protein truncated at amino acid position 157 
and fused with an 88 amino acid protein encoded by the Neo r gene promoter 

30 sequence. 

After DNA transfections, 6 of 1 15 Neo-resistant ES cell clones carried 
the targeted allele as determined by Southern blot analysis of EcoR V 
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digested genomic DNA using a 1.5 kb DNA probe (Figure 10D). Appropriate 
targeting was further verified in two clones by Southern analysis of ES cell 
DNA digested with BamH I (>12 kb fragment was reduced to a 6.5 kb band 
in targeted cells), Kpn I (7.2 kb became 5.5 kb), and Ssp I (5.6 kb became 
5 7.0 kb) using the same probe. Cells of one ES cell clone were injected into 
blastocysts that were transferred into foster mothers. Highly chimeric male 
offspring (80-100% according to coat color) bred with C57BL/6 (B6) females 
transmitted the mutation to their progeny (Figure 10E). Mice homozygous 
for disruption of the CD20 gene were obtained at the expected Mendelian 

10 frequency by crossing heterozygous offspring. 

Appropriate targeting of the CD20 gene was further verified by PCR 
analysis of genomic DNA from homozygous offspring (Figure 10F). Wild 
type CD20 mRNA was absent in CD20* ; * mice as confirmed by PCR 
amplification of cDNA generated from splenocytes of CD20" /_ mice (Figure 

15 10G). CD20-deficient mice (CD20" A ) thrived and reproduced as well as their 
wild type littermates and did not present any obvious anatomical or 
morphological abnormalities during the first year of life. 

Absence of cell surface CD20 protein expression in CD20' /_ mice was 
further verified by staining B220 + splenocytes with murine anti-CD20 

20 monoclonal antibodies. Hybridomas producing these antibodies were 
generated using splenocytes from CD20' 7 * mice that were immunized with 
CD20-GFP cDNA-transfected 300.19 cells. Ten hybridomas secreted 
antibodies reactive with 300.19 (Figure 10H) and CHO (Figure 101) cells 
transfected with CD20-GFP cDNA, but not with untransfected CHO or 

25 300.19 cells (Table 6). These antibodies also reacted with CD20 epitopes 
expressed on the cell surface of B220 + splenocytes from wild type mice, but 
not with splenocytes from CD20" A mice (Figure 10J). Therefore, targeted 
mutation of the CD20 gene abrogated cell surface CD20 protein expression. 
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Table 6 

Anti-CD20 Monoclonal Antibodies Generated in CD20 V " Mice 3 

> 









Whole Cell ELISA a 


FACS Analvsis b 




Ah Nflmp 


Vlwl IU 1 lQI 1 1 w 


Isotvoe 


CD20-CHO CHO 


CD20-300.19 300.19 


Spleei 


MB20-1 


MCD20-5 


lqG1, K 


+ 


+ 


+ 


MB20-2 


MCD20-61 


lqG1, K 


+ 


+ 


++ 


MB20-3 


MCD20-86 


lqG3, K 


+ 


+ 


++ 


MB20-6 


MCD20-223 


lqG2a, K 


+ 


+ 


+ 


MB20-7 


MCD20-243 


lqG2b, K 


+ 


+ 


+ 


MB20-8 


MCD20-270 


lgG2b, K 


+ 


+ 


+ 


MB20-10 


MCD20-388 


lgG2b, K 


+ 


+ 


+ 


MB20-1 1 


MCD20-392 


lgG2a, K 


+ 


+ 


+ 


MB20-13 


MCD20-624 


lgG3, K 


+ 


+ 


++ 


MB20-14 


MCD20-642 


lgG1,K 


+ 


+ 


++ 



a Values represent reactivity of the monoclonal antibody with adherent 
5 monolayers of CHO cells either transfected or untransfected with CD20-GFP 
cDNA as assessed by a cell-based ELISA. The monoclonal antibodies did 
not react with GFP cDNA-transfected CHO cells. 

b Cell surface reactivity of the monoclonal antibody with single cell 
suspensions of 300.19 cells either transfected or untransfected with CD20- 
10 GFP cDNA or spleen cells from wild type mice. Values represent relative 
indirect immunofluorescence staining intensity as assessed by flow 
cytometry and shown in figure 10H-J. 
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VIII. B. B Cell Development and Function in CD20~ / ~ Mice 

CD20~ / " mice did not show an obvious propensity for infections during 

their first year of life. They had normal frequencies of IgM" B220 10 pro/pre-B 

cells, lgM + B220 10 immature B cells and lgM + B220 hi mature B cells in the 
5 bone marrow (Figure 11, Table 7). Overall, the number of circulating and 

spleen lgM + B220 + B cells found in CD2CK" mice was increased compared 
with wild type littermates (Table 7). However, an immunohistochemical 
analysis of spleen tissue sections revealed a normal architecture and 
organization of the spleen. In the bone marrow, overall IgM expression was 
10 decreased on immature B cells, yet increased on mature B cells when 
compared with IgM levels expressed by comparable cells in wild type 

littermates. However, overall IgM expression by mature B220 h i B cells in 

the blood, spleen and lymph nodes was slightly lower in CD20- / - mice 
(Figure 11B-D). There were no obvious differences in the size (light scatter 

15 properties) of CD20"/" B cells isolated from bone marrow, blood, lymph 
nodes or spleen when compared with B cells from wild type littermates. 
These data therefore suggest that CD20 plays a functional role in the 
development and tissue localization of B cells. 



20 
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Table 7 



Frequencies and Numbers of B Lymphocytes in CD20' / " Mice 



Tissue 



Phenotype Wild Type CD20 



-/- 



Wild Type CD20 



Bone Marrow B220 lo lgM 



% of B Lymphocytes B cell numbers (x10" 



36 ±2 



34 ±3 



B220 lo lgM + 



19 ±2 



13 ±2 



B220 hi lgM + 



14±2 



16 + 4 



Blood d 



B220 + lgM + 



61 ± 2 



60 ± 3 



3.6 ±0.5 3.9 ±< 



Spleen 



B220 + lgM + 



51 ±6 



53 ± 5 



58 ± 12 



76 ± 



Lymph Nodes 6 B220 + lgM + 26 ± 6 



19±2 



1.2 ±0.3 0.9 ± 



Peritoneum 



B220 + lgM + 



70 ±4 



69 ±5 



2.4 ±0.3 3.1 ± 



B220'°CD5 + 44 ± 4 



15 ± 5 



1.5 ±0.2 0.7 ±( 



B220 hi CD5 



28 ±2 



59 ±3 



** 



1.0 ±0.1 2.7 ±( 



10 



a Values represent mean (± SEM) results obtained from seven 2-month-old of 
wild type controls and 10 CD20" 7 " mice. Numbers represent the percentage 
of lymphocytes (based on side and forward light scatter properties) 
expressing the indicated cell surface markers. 

b B cell numbers were calculated based on the total number of cells 
harvested from the indicated tissues. 

d The values indicate the number of cells/ml. 

e Values represent results from peripheral lymph nodes pairs. 

*The percentage or number was significantly different than in wild-type, p < 
0.05; **p<0.01. 
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Within the peritoneal cavity, the number of lgM + B220+ B cells in 

CD20-/" mice was similar to that of wild-type littermates (Table 7, Figure 

HE). However, there was a 4-fold decrease in the number of CD5 + B220'° 

B1a cells, with a compensatory increase in the number of CD5~ B220 hi B2 
5 cells. Therefore, CD20-deficiency predominantly affected the development 
or clonal expansion of the B1 subpopulation of B cells within the peritoneal 
cavity. Exemplary methods for quantitating B cell populations are described 
in Example 7. 

VIII. C. Reduced fCa ++ 1i Responses in CD20" 7 " B Cells 

10 The loss of CD20 significantly altered early B cell signaling 

responses, measured as described in Example 8. Splenic B220 + B cells 

from CD20' 7 " mice generated substantially reduced [Ca ++ ]i responses 
following surface IgM ligation when compared with wild type B cells. 
Decreased [Ca ++ ]i responses in CD20" A B cells were observed in response 
15 to both optimal (40 (jg/ml, Figure 12A) and suboptimal concentrations (5 
pg/ml) of anti-IgM antibodies. Although the kinetics of [Ca ++ ]i responses in 

CD20~'~ B cells was not altered, the magnitude of both the immediate [Ca ++ ]i 
increase and the sustained increase observed at later time points were 
inhibited by loss of CD20 expression. More dramatic decreases in [Ca ++ ]i 
20 responses (>50%) by CD20 _/ ~ B cells were observed in response to CD19 
ligation with optimal concentrations (40 pg/ml) of antibody (Figure 12A). 

Reduced [Ca ++ ]i responses following CD19 ligation on CD20 _/ ' B cells were 
likely to result from differences in signaling capacity since Thapsigargin- 
induced (Figure 12A) and lonomycin-induced [Ca ++ ]i responses were higher 
25 in CD20' /_ B cells than in wild type B cells. In addition, CD19 expression 
levels were not significantly different between CD20~ /_ and wild type B cells 
(Figure 12A). 

Chelation of extracellular calcium with EGTA reduced the kinetics and 
magnitude of the immediate [Ca ++ ]i increase observed following IgM 
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crosslinking (Figure 12A). However, the [Ca ++ ]i increase observed at later 
time points was not substantially inhibited by EGTA treatment. Similar 
results were observed in CD20" 7 " B cells. By contrast, chelation of 

extracellular calcium with EGTA almost eliminated the [Ca ++ ]i response 
5 observed following CD19 crosslinking (Figure 12A). This suggests that 

transmembrane Ca ++ flux contributes substantially to the [Ca ++ ]i responses 
observed following CD19 crosslinking. That CD20-deficiency had a 

substantial effect on CD19-induced [Ca ++ ]i responses suggests that CD20 
can contribute significantly to transmembrane Ca ++ flux. 

10 The consequences of CD20 loss on transmembrane signal 

transduction was further evaluated by assessing total cellular protein 
tyrosine phosphorylation in purified B cells following IgM ligation. Although 
some variation was observed between B cells from individual mice in 
individual experiments, overall levels of tyrosine phosphorylation in resting 

15 splenic B cells were higher in CD20" 7 ' B cells than in wild type mice (Figure 
12C). In addition, protein phosphorylation in B cells from CD20" 7 " mice 
increased more significantly after B cell antigen receptor (BCR) ligation than 
in wild type B cells. Thus, while CD20 expression can influence BCR- 

induced tyrosine phosphorylation, decreased [Ca ++ ]i responses in CD20" 7 " B 
20 cells are unlikely to result from significant abnormalities in transmembrane 
signaling through the BCR. 



IX. Therapeutic Applications 

Another aspect of the present invention is a therapeutic method 
25 comprising administering to a subject a substance that modulates MS4A 
biological activity. Therapeutic substances include but are not limited to 
chemical compounds, antibodies, and gene therapy vectors. Substances 
that are discovered by the methods disclosed herein are useful for 
therapeutic applications related to disorders of MS4A function. 
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In one embodiment, the present invention provides a method for 
disrupting MS4A function by immunizing a subject with an effective dose of 
the disclosed MS4A polypeptide. The immune system of the subject 
produces an antibody that specifically recognizes the MS4A polypeptide, 
5 and binding of the antibody to the MS4A polypeptide abolishes MS4A 
function. 

In another embodiment, the present invention provides MS4A nucleic 
acid sequences and gene therapy methods for modulating MS4A activity in a 
target cell. The gene therapy vector can encode a MS4A or sequences 
10 encoding a nucleic acid molecule, peptide, or protein that interacts with a 
MS4A protein. 

Vehicles for delivery of a gene therapy vector include but are not 
limited to a liposome, a cell, and a virus. Preferably, a cell is transformed or 
transfected with the DNA molecule or is derived from such a transformed or 

15 transfected cell. Alternatively, the vehicle is a virus, including a retroviral 
vector, adenoviral vector or vaccinia virus whose genome has been 
manipulated in alternative ways so as to render the virus non-pathogenic. 
Methods for creating such a viral mutation are detailed in U.S. Patent No. 
4,769,331. Exemplary gene therapy methods are also described in U.S. 

20 Patent Nos. 5,279,833; 5,286,634; 5,399,346; 5,646,008; 5,651,964; 
5,641,484; and 5,643,567. 

The therapeutic methods of the present invention can be applied in 
the treatment of a variety of conditions, including in the treatment of non- 
Hodgkin's lymphoma and in the treatment of atopic disorders or other 

25 allergenic diseases. Application of the present inventive therapeutic 
methods are evidenced by the current U.S. Food and Drug Administration 
approved use of antibodies against CD20 in the treatment of non-Hodgkin's 
lymphoma. Additionally, the therapeutic methods of the present invention 
are illustrated in view of the recognition in the art that genetic variations at 

30 chromosome 11Q12-13 can also play a role in the pathogenesis of atopic 
disorders and other allergenic diseases. Indeed, it has been recognized that 
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FcsRip contributes to such diseases, and thus the MS4A genes identified in 
accordance with the present invention are envisioned also to contribute to 
allergenic disease. Therefore the present therapeutic methods, which 
pertain to the modulation of the biological activity of an MS4A polypeptide of 
5 the present invention have application with respect to the treatment of such 
disorders. 

X. Summary 

The invention comprises 19 new genes that are members of a class 
10 of genes encoding MS4A proteins. Three members have been described, 
CD20, FcsRip, and HTm4. A gene family has been defined based on a 
shared chromosomal location, conservation of protein size and structure, 
gene structure conservation, and similar expression in hematopoietic cells. 
MS4A proteins function as oligomeric cell surface complexes, and complex 
15 assembly using diverse MS4A members is implicated as a mechanism for 
regulating complex function. 

Two members of this class, CD20 and FcsRip, have been described 
functionally, and in each case an important function has been delineated. 
CD20 is required for cell cycle progression and signal transduction in B 
20 lymphocytes. CD20 also regulates Ca ++ conductance, possibly as a cation 
channel subunit. Of clinical relevance, antibodies that recognize CD20 are 
effective in treating non-Hodgkin's lymphoma. FcsRip mediates interactions 
with IgE-bound antigens that lead to degranulation of mast cells, and 
variation of the FcsRip locus is implicated in allergenic disease. 

25 The utility of the MS4A genes is based in part on overlapping or 

shared functions with known MS4A members. In one case, new MS4A 
genes have important potential as part of a CD20 complex. The structural 
description of CD20 complexes suggests that one or more CD20-related 
proteins constitute the functional complex. Thus, new MS4A proteins can 

30 define antigens useful for lymphoma treatment. In another case, MS4A 
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genes are implicated in IgE responses. Atopic disorders (allergy, asthma, 
eczema, allergic rhinitis) are dysfunctional IgE responses and are associated 
with a locus on human chromosome 1 1 q containing most members of the 
MS4A gene family. FcsRip is one relevant factor, and recent work supports 
5 that FceRip as well as other genetic elements in the region contribute to the 
disease. Thus, as disclosed herein, the present MS4A sequences also have 
utility in the characterization, diagnosis, and potential treatment of atopy 
linked to the chromosomal location wherein MS4A genes are located. 



10 Examples 

The following Examples have been included to illustrate modes of the 
invention. Certain aspects of the following Examples are described in terms 
of techniques and procedures found or contemplated by the present co- 
inventors to work well in the practice of the invention. These Examples 
15 illustrate standard laboratory practices of the co-inventors. In light of the 
present disclosure and the general level of skill in the art, those of skill will 
appreciate that the following Examples are intended to be exemplary only 
and that numerous changes, modifications, and alterations can be employed 
without departing from the scope of the invention. 

20 Example 1 

Database Searches and cDNA Isolation 

Three hundred and thirty seven nucleotide sequences obtained from 

i 

the translated GenBank database of expressed sequence tags (ESTs) were 
assembled into sixty-two subgroups of contiguous linear segments based on 

25 their overlapping sequences and potential for encoding proteins homologous 
with CD20. Based on these subgroups, EST cDNAs (Figure 1) were 
obtained from the ATCC and sequenced. Based on the complete 
sequences of twenty-one near full-length EST cDNAs, eleven novel genes 
were defined in human and mouse that unified multiple EST subgroups. 

30 Near full-length EST clones representing these genes are shown in Figure 1. 
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These eleven genes and five additional genes were also identified by PCR 
amplification of transcripts using subgroup-specific primers or primers based 
on EST sequences. The specific details of how cDNAs representing the five 
genes that were not identified by EST cDNA clones are indicated below. In 
5 all cases, ESTs and cDNAs encoding the predicted coding regions of each 
putative unique gene were sequenced in both directions and at least two 
independent ESTs and/or cDNAs representing near full-length gene 
products were sequenced. Thereby, there was independent confirmation of 
accuracy for all of the sequences reported. 

10 Based on EST subgroup sequences, cDNAs encoding mouse 

MS4a4B and MS4a4C were isolated by PCR amplification of C57BL/6 
mouse spleen cDNA using both Taq and Pfu DNA polymerase. Primers for 
MS4a4B (SEQ ID NOs:63-64) amplified an 879 bp fragment. Primers for 
MS4a4C (SEQ ID NOs:65-66) amplified a 794 bp fragment. EST sequences 

15 for MS4a4D only encoded the 3' end of the predicted protein. Since 
MS4a4D sequences were closely related to MS4a4B and MS4a4C 
sequences, a sense 5' primer (SEQ ID NO:67) based on consensus 
MS4a4B and MS4a4C sequences and a MS4a 4D-specific antisense primer 
(SEQ ID NO:68) were used to amplify a 773 bp fragment from cDNA of 

20 C57BL/6 mouse lung. 

MS4a6C was initially identified based on one unique EST sequence 
(AA028258) encoding a mouse protein homologous with the C-terminal end 
of MS4a6B. MS4a6C cDNAs were isolated by PCR amplification of 
C57BL/6 mouse bone marrow cDNA using Taq polymerase. A primer based 

25 on identical sequences at the 5' end of the MS4a6B and MS4a6D cDNAs 
(SEQ ID NO:69) was used in combination with an antisense primer specific 
for the unique EST sequence (SEQ ID NO:70) to amplify a 787 bp fragment. 
Sequences from multiple independent PCR-amplified cDNAs were identical. 
Subsequently, the PCR-generated 5* end of the near full-length MS4a6C 

30 cDNA was found to be identical to an orphan EST subgroup sequence that 
had not been linked with defined 3' sequences. Thereby, the EST subgroup 
sequences verified that the PCR-amplified 5' end of the MS4a6C cDNAs 
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was appropriate. In addition, the overall MS4a6C sequence was similar to 
the sequence of MS4a6B cDNAs without interruption. Thus, the MS4a6C 
cDNA united sequences identical to those found in two non-overlapping 
CD20-homologous EST subgroups. cDNAs encoding a 473 bp fragment of 
5 mouse MS4a3 were amplified from cDNA of C57BL/6 bone marrow as 
described above. Primers (SEQ ID NOs:71-72) were obtained based on a 
single thymic cDNA EST sequence (GenBank AA940479) where the 
corresponding cDNA was not available. 

Human MS4A and mouse MS4a cDNA sequences (MS4A1 to 
10 MS4A12) (disclosed herein) were used to search the htgs GenBank human 
genomic database of unfinished human genomic sequences 
(http://www.ncbi.nlm.nih.gov/blast/) using the BLAST program. Seventeen 
phase 1 or phase 2 human genomic DNA sequences encoding potential 
MS4A genes were assembled into groups of contiguous linear segments 
15 based on their overlapping sequences. Three EST clones corresponding to 
partial MS4A6E transcripts were obtained from the ATCC and sequenced 
completely on both DNA stands. 

All PCR-amplified cDNAs were subcloned and sequenced entirely in 
both directions. Complete sequencing of at least two distinct PCR- 

20 generated cDNAs from both Taq and Pfu enzyme was performed in most 
cases. Differences between cDNA sequences were only noted when 
multiple cDNA clones generated by both Taq and Pfu polymerases revealed 
identical differences. In some cases, cDNAs or EST sequences contained 
potential intronjexon splice sites that delimited structural domains and 

25 aligned with the known intron|exon splice sites of CD20 (Tedder et al. 
(1989b) J Immunol 142:2560-2568). In these cases, potential introns were 
flanked by consensus splice donor and/or splice acceptor sequences (Aebi 
& Weissmann (1987) Trends Genet 3:102-107) or were likely to represent 
splice variants where exons were deleted. 
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Example 2 

RNA Isolation and Reverse Transcription-PCR 

Reverse transcription-PCR amplification (RT-PCR) was as described 
previously (Zhou & Tedder, 1995) with minor modifications. Total RNA was 
5 extracted from 1-2 x 10 7 hematopoietic cell lines using a RNeasy Mini Kit 
(Qiagen, Inc., Chatsworth, California) according to the manufacturer's 
instructions. Human hematopoietic cell lines included one pre-B cell line 
(NALM-6), three B cell lines (BJAB, DAUDI, and SB), four T cell lines (HSB- 
2, HUT-78, JURKAT, and MOLT15), two myelomonocytic lines (HL60 and 

10 U937), and one erythroleukemia cell line (K562). RNA concentrations were 
determined by UV absorbance. Ten jLig of total RNA was reverse 
transcribed. In some cases, cDNA from any of 8 different human tissues 
(colon, ovary, blood mononuclear cells, prostate, small intestine, spleen, 
testes, and thymus; from CLONETECH Laboratories, Inc., Palo Alto, 

15 California) was analyzed. RT-PCR amplification was performed using gene- 
specific primers identical with protein coding regions of the predicted MS4A 
genes during 35 cycles (94°C for 1 min, 55°C for 1.5 min, 72°C for 1.5 min, 
followed by extension at 72°C for 5 min). Following amplification, the PCR 
products were separated on 1% agarose-ethidium bromide gels and 

20 photographed. G3PDH, a housekeeping gene, was also amplified to control 
for sample to sample variation. RNA amplified without reverse transcription 
was used as a negative control, and was negative in all cases. 

Example 3 

Recombinant Production of MS4A Protein 

25 For recombinant production of a protein of the invention in a host 

organism, a nucleotide sequence encoding the protein is inserted into an 
expression cassette designed for the chosen host and introduced into the 
host where it is recombinantly produced. The choice of the specific 
regulatory sequences such as promoter, signal sequence, 5' and 3' 

30 untranslated sequence, and enhancer appropriate for the chosen host is 
within the level of ordinary skill in the art. The resultant molecule, containing 
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the individual elements linking in the proper reading frame, is inserted into a 
vector capable of being transformed into the host cell. Suitable expression 
vectors and methods for recombinant production of proteins are well known 
for host organisms such as E. coli, yeast, and insect cells (see, e.g., 
5 Lucknow & Summers (1988) Bio/Technol 6:47). Additional suitable 
expression vectors are baculovirus expression vectors, e.g., those derived 
from the genome of Autographies californica nuclear polyhedrosis virus 
(AcMNPV). 

Recombinantly produced proteins are isolated and purified using a 
10 variety of standard techniques. The actual techniques used varies 
depending upon the host organism used, whether the protein is designed for 
secretion, and other such factors. Such techniques are well known to the 
skilled artisan. See Ausubel et al. (1994). 

Example 4 

15 Mouse Anti-Mouse CD20 Monoclonal Antibody Production 

Hybridomas producing CD20-specific mouse monoclonal antibodies 
were generated by the fusion of NS-1 myeloma cells with spleen cells from a 
CD20~ / " mouse immunized with a cell line expressing a mouse CD20-GFP 
fusion protein. The CD20-GFP fusion protein was generated by subcloning 

20 a fragment of the pmB1-1 cDNA (from 159 to 1050 bp of SEQ ID NO:39) into 
the PEGFP-N1 vector (Clonetech Laboratories Inc., Palo Alto, California) to 
generate an open reading frame encoding the entire CD20 protein with GFP 
fused to the carboxyl-terminal end. The resulting plasmid was linearized 
with ApaL I and used to transfect 300.19 cells, a mouse pre-B cell line, and 

25 Chinese Hamster Ovary (CHO) cells. Transfection was by Lipofectamine 
following the manufacturer's instructions (Clonetech Laboratories, Inc.). 
Transfected cells were selected using GENETICIN™ (1 mg/ml, GIBCOBRL) 
in RPMI 1640 media (Sigma) for 300.19 cells or H-12 nutrient mixture 
(GIBCOBRL) for CHO cells. Both media were supplemented with 10% FCS, 

30 L-glutamine, streptomycin and penicillin. Transfected cells expressing high 
levels of CD20-GFP were isolated by fluorescence-based cell sorting. 
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Example 5 

In vitro Binding Assays 

Recombinant protein can be obtained, for example, according to the 
approach described in Example 4 herein above. The protein is immobilized 
5 on chips appropriate for ligand binding assays. The protein immobilized on 
the chip is exposed to sample compound in solution according to methods 
well known in the art. While the sample compound is in contact with the 
immobilized protein, measurements capable of detecting protein-ligand 
interactions are conducted. Measurement techniques include, but are not 
10 limited to, SEDLI, Biacore, and FCS, as described above. Compounds 
found to bind the protein are readily discovered in this approach and are 
subjected to further characterization. 

Example 6 

Generation of CD20-Deficient Mice 

15 DNA encoding the CD20 gene was isolated from a phage library 

prepared from 129/Sv strain mouse DNA (Figure 10A), mapped with 
restriction endonucleases, and sequenced to identify intron|exon boundaries 
(Figure 10B). The targeting vector was constructed using a pBluescript SK 
(Stratagene, La Jolla, California)-based targeting vector (p594, provided by 

20 Dr. David Milstone, Brigham and Women's Hospital, Boston, 
Massachusetts). A DNA fragment starting at the Pst I site in CD20 exon 5 
through the EcoR V site in exon 6 (-1.8 kb) was isolated and blunt end 
ligated into the targeting vector downstream of the pMC1-HSV thymidine 
kinase gene and upstream of the neomycin resistance marker obtained from 

25 pGK-neo poly A (Stratagene) that contained the PGK promoter and poly A 
signal sequence. An -10 kb DNA fragment beginning at the Kpn I site 
downstream of exon 8 was also isolated and inserted into the targeting 
vector downstream of the neomycin resistant gene. The plasmid was 
linearized using a unique Sal I restriction site proximal to the 3' end of the 

30 CD20 gene insert and used to transfect ES cells. 
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ES ceils were transfected with linearized plasmid DNA and selected 
for G418 resistance as described (Keller and Smithies (1989) Proc Natl 
Acad Sci USA 886:8932). Genomic DNA from individual selected clones 
was digested with EcoR V and used for Southern blot analysis along with a 
5 radiolabeled -1.5 kb DNA probe that was external to the targeting vector 
(Figure 10D). A 4.6 kb genomic DNA fragment hybridized with the probe in 
wild type ES cells or a 6.3 kb fragment in appropriately targeted ES cells (Fig 
1E). Genomic DNA generated by BamH I, Ssc I or Kpn I digestion was also 
analyzed for appropriate targeting. The Southern blot pattern obtained in all 

10 cases was consistent with the appropriate predicted mutation indicating that 
detrimental recombinations did not occur in the vicinity of the desired 
homologous recombination. Cells from appropriately targeted ES cell clones 
were injected into 3.5 day old C57BL/6 blastocysts that were transferred into 
foster mothers. Offspring carrying the mutant CD20 allele were identified by 

15 Southern blot analysis of DNA obtained from tail biopsies. 

High chimeric males (80-100% according to color) were bred with 
C57BL/6 (B6) females to generate heterozygous offspring with germline 
gene transmission, which were crossed to generate the homozygous CD20" 7 " 
and wild type littermates used for this study. In some cases, B6/129F1J 

20 (Jackson Laboratory) were used as controls. Results obtained using wild 
type littermates of CD20 +/ " mice were similar and were therefore pooled. All 
mice were between 2-3 months of age when used for this study. Mice were 
housed in a specific pathogen-free barrier-facility. All studies and 
procedures were approved by the Animal Care and Use Committee of Duke 

25 University. 

Example 7 

Flow Cytometric Analysis of Lymphocyte Subsets 

Single cell suspensions of lymphocytes from the spleen, bone 
marrow, peripheral lymph nodes, and peritoneal cavity were isolated from 
30 CD20" A and wild type mice and counted using a hemocytometer prior to two- 
color immunofluorescence analysis. Retroorbital venous plexus puncture 
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was utilized to obtain circulating leukocytes. Leukocytes (0.5 x 10 6 ) were 
stained at 4°C using predetermined optimal concentrations of the test 
monoclonal antibody for 20 min as described (Zhou et al. (1994) Mol Cell 
Biol 14:3884-3894). Blood erythrocytes were lysed after staining using the 
5 Coulter Whole Blood Immuno-Lyse kit as detailed by the manufacturer 
(Coulter, Inc., Miami, Florida). Cells were washed and analyzed on a 
FACScan flow cytometer (Becton Dickinson, San Jose, California). 

Antibodies used in this study included the following: biotin, FITC- 
conjugated anti-B220 Mab (CD45RA, RA-3, 6B2, provided by Dr. Robert 

10 Coffman, DNAXCORP, Palo, Alto, California); PE-conjugated anti-mouse 
Thy1.2 (Caltag Laboratories, Burlingame, California); B220-PE (Caltag 
Laboratories, Burlingame, California); biotin-conjugated anti-l-A (BD 
PharMingen, Franklin Lakes, New Jersey); PE or APC-conjugated anti-CD5 
(BD PharMingen); PE-conjugated goat anti-mouse lgG3-specific antibody 

15 (Southern Biotechnology Associates Inc., Birmingham, Alabama); and biotin- 
conjugated anti-mouse IgD (Southern Biotechnology Associates Inc., 
Birmingham, Alabama). FITC or biotin-conjugated goat anti-mouse IgM 
isotype-specific antibodies (Southern Biotechnology Associates Inc., 
Birmingham, Alabama) were also used. 

20 Phycoerythrin-conjugated Streptavidin (Southern Biotechnology 

Associates Inc., Birmingham, Alabama) was used to reveal biotin-coupled 
monoclonal antibody staining. The percent positively stained lymphocytes 
was determined using a FACScan flow cytometer (Becton Dickinson, San 
Jose, California). Positive and negative populations of cells were 

25 determined by using unreactive monoclonal antibody (Caltag Laboratories, 
Burlingame, California) as controls for background staining. Background 
levels of staining were delineated using gates positioned to include 98% of 
the control cells. Ten thousand cells with the forward and side light scatter 
properties of lymphocytes were analyzed for each sample. 
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Example 8 

Intracellular Calcium Measurements 

Changes in lymphocyte [Ca 2+ ]j levels were monitored by flow 
cytometry analysis as described (Fujimoto et al. (1999) Immunity 11:191). 
5 Single cell suspension of splenocytes were resuspended (1 x 10 7 /ml) in 
RPMI 1640 medium containing 5% FBS, 10 mM HEPES and loaded with 
1|nM of indo-1-AM for 30 min at 37°C. Splenocytes were then washed and 
incubated with a predetermined optimal concentration of FITC-conjugated 
anti-B220 monoclonal antibody for 15 min at room temperature. The 

10 splenocytes were washed again and resuspended at 2 x 10 6 /ml in medium. 
The fluorescence ratio (405/525 nm) of B220 + splenic B cells was monitored 
by flow cytometry at baseline for 1 min and for 6 min after stimulation with 
optimal and suboptimal concentrations of goat F(ab')2 anti-IgM antibody (5- 
40 jag/ml), optimal concentrations of anti-mouse CD19 monoclonal antibody 

15 (40 |xg/ml), Thapsigargin (1 |ag/ml; Sigma), or lonomycin (2.67 jag/ml; 
Calbiochem Biosciences, Inc., La Jolla, California). In some cases, EGTA (5 
mM final; pH 7.0) was added to the cells, immediately followed by 
stimulation with the inducing agents described above. Results were plotted 
as the fluorescence ratio at 20 sec intervals with background fluorescence 

20 subtracted. An increase in the fluorescence ratio indicates an increase in 
[Ca 2+ ]i. 

Example 9 

Characterization of a MS4A Promoter Region 
A preferred in vitro technique for evaluating MS4A promoter function 
25 is a transient transfection assay. According to this method, one or more 
chimeric reporter genes comprising a MS4A promoter region is introduced 
into a relevant host cell (e.g., a hematopoietic cell), and the resulting level of 
reporter gene expression is quantitated. Representative methods for making 
an expression system comprising a promoter region operably linked to a 
30 heterologous reporter sequence are disclosed in U.S. Patent No. 6,087,1 11. 

To analyze the function of a MS4A promoter region in vivo, transgenic 
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mice bearing a chimeric gene comprising a MS4A promoter region are 
generated, and a level of reporter gene expression in each mouse is 
determined. 

Within a candidate promoter region or response element, the 
5 presence of regulatory proteins bound to a nucleic acid sequence can be 
detected using a variety of methods well known to those skilled in the art 
(Ausubel et al., 1992). Briefly, in vivo footprinting assays demonstrate 
protection of DNA sequences from chemical and enzymatic modification 
within living or permeabilized cells. Similarly, in vitro footprinting assays 

10 show protection of DNA sequences from chemical or enzymatic modification 
using protein extracts. Nitrocellulose filter-binding assays and gel 
electrophoresis mobility shift assays (EMSAs) track the presence of 
radiolabeled regulatory DNA elements based on provision of candidate 
transcription factors. Computer analysis programs, for example TFSEARCH 

15 version 1.3 (Yutaka Akiyama: "TFSEARCH: Searching Transcription Factor 
Binding Sites", http://www.rwcp.or.jp/papia/), can also be used to locate 
consensus sequences of known cis-regulatory elements within a genomic 
region. 



20 
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CLAIMS 



What is claimed is: 



1. An isolated MS4A polypeptide, or functional portion thereof, 
comprising: 

5 (a) a polypeptide encoded by the nucleotide sequence of 

anyone of the odd-numbered SEQ ID NOs:1-37; 

(b) a polypeptide encoded by a nucleic acid molecule that is 
substantially identical to any one of the odd-numbered 
SEQ ID NOs:1-37; 

10 (c) a polypeptide having the amino acid sequence of any 

one of the even-numbered SEQ ID NOs:2-38; 

(d) a polypeptide that is a biological equivalent of the 
polypeptide of any one the even-numbered SEQ ID 
NOs:2-38; or 

15 (e) a polypeptide which is immunologically cross-reactive 

with an antibody that shows specific binding with a 
polypeptide of any one of the even-numbered SEQ ID 
NOs:2-38. 

2. An isolated nucleic acid molecule encoding a MS4A 
20 polypeptide, comprising: 

(a) the nucleotide sequence of any one of the odd- 
numbered SEQ ID NOs:1-37; or 

(b) a nucleic acid molecule substantially identical to any 
one of the odd-numbered SEQ ID NOs:1-37. 

25 3. The isolated nucleic acid molecule of claim 2, comprising a 20 

nucleotide sequence that is identical to a contiguous 20 nucleotide sequence 
of anyone of the odd-numbered SEQ ID NOs:1-37. 

4. A chimeric gene, comprising the nucleic acid molecule of claim 
2 operably linked to a heterologous promoter. 
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5. A vector comprising the chimeric gene of claim 4. 

6. A host ceil comprising the chimeric gene of claim 4. 

7. The host cell of claim 6, wherein the cell is selected from the 
group consisting of a bacterial cell, a hamster cell, a mouse cell, and a 

5 human cell. 

8. A method of detecting a nucleic acid molecule that encodes a 
MS4A polypeptide, the method comprising: 

(a) procuring a biological sample comprising nucleic acid 
material; 

10 (b) hybridizing the nucleic acid molecule of claim 2 under 

stringent hybridization conditions to the biological 
sample of (a), thereby forming a duplex structure 
between the nucleic acid of claim 2 and a nucleic acid 
within the biological sample; and 

15 (c) detecting the duplex structure of (b), whereby a MS4A 

nucleic acid molecule is detected. 

9. An antibody that specifically recognizes a MS4A polypeptide of 
claim 1. 

10. A method for producing an antibody that specifically 
20 recognizes a MS4A polypeptide, the method comprising: 

(a) recombinantly or synthetically producing a MS4A 
polypeptide, or portion thereof; 

(b) formulating the polypeptide of (a) whereby it is an 
effective immunogen; 

25 (c) administering to an animal the formulation of (b) to 

generate an immune response in the animal comprising 
production of antibodies, wherein antibodies are present 
in the blood serum of the animal; and 
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(d) collecting the blood serum from the animal of (c), the 
blood serum comprising antibodies that specifically 
recognize a MS4A polypeptide. 

11. A method for detecting a level of MS4A polypeptide, the 
5 method comprising 

(a) obtaining a biological sample comprising peptidic 
material; and 

(b) detecting a MS4A polypeptide in the biological sample 
of (a) by immunochemical reaction with the antibody of 

10 claim 9, whereby an amount of MS4A polypeptide in a 

sample is determined. 

12. A method for identifying a substance that modulates MS4A 
function, the method comprising: 

(a) isolating a MS4A polypeptide of claim 1 ; 

1 5 (b) exposing the isolated MS4A polypeptide to a plurality of 

substances; 

(c) assaying binding of a substance to the isolated MS4A 
polypeptide; and 

(d) selecting a substance that demonstrates specific 
20 binding to the isolated MS4A polypeptide. 

13. A method for modulating MS4A function in a subject, the 
method comprising: 

(a) preparing a pharmaceutical composition, comprising a 
substance identified according to the method of claim 10 

25 or 12, and a carrier; and 

(b) administering an effective dose of the pharmaceutical 
composition to a subject, whereby MS4A activity is 
altered in the subject. 
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14. The method of claim 13, wherein the substance is an antibody, 
a protein, a peptide, or a chemical compound. 

15. The method of claim 13, wherein MS4A activity is regulation of 
the abundance of target cell subpopulations. 

5 16. The method of claim 13, wherein MS4A activity is regulation of 

[Ca 2+ ]j levels. 

17. A method for identifying a candidate compound as a modulator 
of MS4A gene expression, the method comprising: 

(a) exposing a cell sample with a candidate compound to be 
10 tested, the cell sample containing at least one cell containing a 

DNA construct comprising a modulatable transcriptional 
regulatory sequence of a MS4A-encoding nucleic acid and a 
reporter gene which is capable of producing a detectable 
signal; 

15 (b) evaluating an amount of signal produced in relation to a control 

sample; and 

(c) identifying a candidate compound as a modulator of MS4A 
gene expression based on the amount of signal produced in 
relation to a control sample. 

20 18. The method of claim 17, wherein the modulatable 

transcriptional regulatory sequence of a MS4A-en coding nucleic acid 
comprises a sequence that is immediately upstream of the initial coding 
region of a MS4A gene as set forth in any one of SEQ ID NOs:73-81 . 

19. A method for modulating MS4A function in a subject, the 
25 method comprising: 

(a) preparing a gene therapy vector having a nucleotide 
sequence encoding a MS4A polypeptide or a nucleotide 
sequence encoding a nucleic acid molecule, peptide, or 
protein that interacts with a MS4A nucleic acid or 
30 polypeptide; and 
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administering the gene therapy vector to a subject, 
whereby the function of MS4A in the subject is 
modulated. 
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Figure 3 



A4E ATCCGGGAAA TG GGGGCT GTAGGAGAAT GGTGTCTGAG AGAAGACAGA AAC TGTTAGT 

A4A ATGTGGGCAA TGGTGGGGAA GTGGGGGAAT AGTGTCTGAG AGAAGATAGA GACTATAAAA TGCTGTTCAT 

A4E GTGTGAGAAG ATTAGATGCT GTGGTGGGTT TGTTGCTATT CTTGTAAGGT GGTCATGGCC GGGGGAGTGG AAC 3 

A4 A GTGTGAGAAG ATTAGATGAT GTGGTGGGTT TGTTTTTGTT CTTACAAGCT GGTCA CA AGAGGGGTGG \KKd\ 3 

A4E ATT CCT GCA AAC CGG CTC AAC ATA TGC AAA TGT CTC AAT ATA GGA ATG AAA TTA CTT 60 

A4A ATT CCT GCA AAT GGT TTC AAT ATA TGC AGA TGT CTC GAT ATA GGA ATG AAA TTA CGT CTT 63 

A4E TAG AAC AAC TTA AAT AAG TCC AAT ATA CTT GGG GCT TTA AAA AAT AAA AAG GAG AAA TTC 120 

A4A TGG AAC AAC TTA AAT AAG TCA AAT ATA CTT GGA GCT TTA AA AAT TAA AAG GAG AGA GAT 122 

A4E MTTMQGMEQTTPG 

A4E AAG | AGC ACC TTT TCT GCT GCC ATG ACA ACC ATG CAA GGA ATG CAA CAG ACC ACT CCA GGG 180 

A4A TCG AGjC ACC TTT TCT GCT GCC ATG ACA ACC ATG CAA GGA ATG GAA CAG GCC ATG CCA GGG 182 

A4EPGPDVPQLGNIDVIHSYLCK 

A4E CCT GGC CCT GAT GTG CCC CAG CTC GGA AAC ATA GAT GTC ATA CAT TCA TAT CTG TGT AAA 240 

A4 A GCT GGC CCT GGT CTG CCC CAG CTG GGA AAC ATG CCT GTC ATA CAT TCA CAT CTG TGG AAA 142 

A4E GLQEKFFKRKPKVLGVVRI L 

A4E GCA TTG CAA GAG AAG TTC TTC AAG AGG AAA CCC AAA GTC CTT GGG } GTT GTG CGG ATT CTC 300 

A4A GGA TTG CAA GAG AAG TTC TTG AAG GGA GAA CCC AAA GTC CTT GGG I GTT GTG CAG ATT CTG 302 

A4E IALMSLSMCIIMMCVAFSSY 

A4E ATT GCC TTG ATG AGC CTT AGC ATC GGA ATA ATA ATG ATG TGT GTT GCA TTT AGT TCT TAT 360 

A4A ACT GCC CTG ATG AGC CTT AGC ATC GGA ATA ACA ATG ATG TGT ATG GCA TCT AAT ACT TAT 3 62 

A4EEEHPIFVYVAYTIWGSVMF I 

A4E GAA GAA CAT CCC ATT TTT GTG TAT GTT GCG TAC ACA ATT TGG GGG TCA GTG ATG I TTT ATT 420 

A4A GGA AGT AAC CCT ATT TCC GTG TAT ATC GGG TAC ACA ATT TGG GGG TCA GTA ATG 1 TTT ATT 422 

A4E ISVSLSVAAGIRTTKGLVGG 

A4E ATT TCA GTA TCC TTA TCA GTT GCA GCA GGA ATT AGA ACA ACA AAA GGT CTG {GTT GGA GGT 480 

A4A ATT TCA GGA TCC TTG TCA ATT GCA GCA GGA ATT AGA ACT ACA AAA GGC CTG | GTC CGA GGT 4 82 

A4E SLGKNITSSVfiAISGILINA 

A4E AGT CTA GGA AAG AAT ATC ACC AGT TCA GTC TTG GCT ATA TCA GGG ATC TTA ATC AAT GCA 540 

A4A AGT CTA GGA ATG AAT ATC ACC AGC TCT GTA CTG GCT GCA TCA GGG ATC TTA ATC AAC ACA 542 

A4E I S LTFY S FRYHYCNHDQLS S 

A4E ATA AGC TTG ACG TTT TAT TCA TTC CGT TAC CAT TAC TGT AAC CAC GAT CAG TTG TCA AGT 600 

A4A TTT AGC TTG GCG TTT TAT TCA TTC CAT CAC CCT TAC TGT AAC TAC TAT GGC AAC TCA AAT 602 

A4E NCYMTMSILMGTDGMVLL LS 

A4E AAT TGT TAC ATG ACT ATG TCC ATT TTA ATG {GGT ACG GAT GGC ATG GTG CTC CTC TTA AGT 660 

A4A AAT TGT CAT GGG ACT ATG TCC ATC TTA ATG { GGT CTG GAT GGC ATG GTG CTC CTC TTA AGT 662 

A4E VLEFCIAVSLSAFGCKVLCC 

A4E GTG CTG GAA TTC TGC ATT GCT GTG TCC CTC TCT GCC TTT GGA TGT AAA GTG CTT TGT TGT 720 

A4A GTG CTG GAA TTC TGC ATT GCT GTG TCC CTC TCT GCC TTT GGA TGT AAA GTG CTC TGT TGT "722 

A4E SPSEFVLITPSNSHMAEIAS 

A4E AGC CCC AGT GAG {TTT GTG CTA ATT ACA CCA TCA AAT TCT CAC ATG GCA GAA ATA GCA TCT 780 

A4A ACC CCT GGT GGG (GTT GTG TTA ATT CTG CCA TCA CAT TCT CAC ATG GCA GAA ACA GCA TCT 782 

A4E PTPLKT V" 

A4E CCC ACA CCA CTT AAG ACA GTT TGA TGCCACCAAA AGATTAACAG AAGAATGCTC CAGAAATCTA 84 4 

A4A CCC ACA CCA CTT AAT GAG GTT TGA GGCCACCAAA AGATCAACAG ACAAATGCTC CAGAAATCTA 846 

A4E TGCTGACTGT AACACAAGAA CC CACATGA GAAAGTACCA GAATCCAACT CCAATACTGA TAGACATATT 913 
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A4A GACACATftAT AAAfTTCAAAA TTATCTTCTC ATTTTTTTCC 1001 
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A4A CTGGAACTCA ATAACTCATT TCACTCGCTC TTTATCGAGA GTACTAGAAG TTAAATT fiAT™ 'AAAf TAATGCA 1071 

TTAGGACTTT ATATAAAGGC ATTAAACTGGC 1194 

TAAGAACTTT ATATAAAGGC ATTACATTGGC 1142 

CTAAAACGAG GTCTCCATGC AAA CACATA 1259 

TTAAAATGAG GCCTCCATGC AAAACACATA 1212 

TTTTTTTCTG CCATATTTCG GGACCAAAGT 1329 

TTTTTTTCTC CTACCTATGG GGACCAAAGT 1265 
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rrCCCTTTT TCCTTCTCCT GCTTTTCTTT 1335 
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"lTGTGCACG AGCTATCATT TCACTAAAAG 14 05 
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A4E TAGAAG C AATACTT TC A TCT 1695 

A4A AAATTAGAAG ClAATAAAlAAA AAAAAAAAAA AAAA 1619 
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A6E MTSQPI SNETI IM 

A6E TT GGC AAC ACC ATT ATG ACA TCA CAA CCT ATT TCC AAT GAG ACC ATC ATA ATG 53 

A6A TT GGC AAC ACC ATC ATG ACA TCA CAA CCT GTT CCC AAT GAG ACC ATC ATA GTG 277 

AGE LPSNVINFSQAE'KPEPTNQG 

AGE CTC CCA TCA AAT GTC ATC AAC TTC TCC CAA GCA GAG AAA CCC GAA CCC ACC AAC CAG GGG 113 

A6A CTC CCA TCA AAT GTC ATC AAC TTC TCC CAA GCA GAG AAA CCC GAA CCC ACC AAC CAG GGG 337 

AGE QDSLKKRLQAKVKVIG 

AGE CAG GAT AGC CTG AAG AAA CGT CTA CAG GCA AAA GTC AAA GTT ATT GGG 1 161 

A6A CAG GAT AGC CTG AAG AAA CAT CTA CAC GCA GAA ATT AAA GTT ATT GGG | (exons MS4AGA) 397 

A6E VHSSLAGSILSALSALVGFI 

AGE GTG CAT AGC AGC CTG GCT GGA AGC ATT CTG AGT GCT CTG TCT GCC CTG GTG GGT TTC ATT 221 

A6A GTG CAT AGC AGC CTG GTT GGA AGC ATT CTG AGT GCT CTG TCT GCC CTG GTG GGT TTC ATT 637 

A6E LLSVNPAALNPASLQCKLDE 

A6E CTC CTG TCT GTC AAC CCG GCT GCA TTA AAT CCT GCC TCA TTG CAG TGT AAG TTG GAC GAA 281 

A6A ATC CTG TCT GTC AAA CAG GCC ACC TTA AAT CCT GCC TCA CTG CAG TGT GAG TTG GAC AAA 697 

A6E KDIPTRLLLSYDYH SPYTM 

A6E AAG GAT ATA CCA ACC AGA CTT CTT CTT TCT TAT GAT TAT CAT TCA CCT TAC ACC ATG 338 

A6A AAT AAT ATA CCA ACA AGA AGT TAT GTT TCT TAC TTT TAT CAT GAT TCA CTT TAT ACC ACG 757 

AGE DCHRAKASLAGTLSL-MLVST 

AGE GAC TGC CAT AGA GCC AAA GCC AGT CTG GCT 1 GGA ACT CTG TCT CTG ATG CTG GTT TCT ACT 3 98 

AGA GAC TGC TAT ACA GCC AAA GCC AGT CTG GCT | GGA ACT CTC TCT CTG ATG CTG ATT TGC ACT 817 

AGE VLEFCLAV LTAVLQWKQTV* 

A6E GTG TTG GAG TTC TGC CTA GCT GTG ' CTC ACT GCT GTG CTG CAG TGG AAA CAG ACT GTC TGA 4 58 

AGA CTG CTG GAA TTC TGC CTA GCT GTG CTC ACT GCT GTG CTG CGG TGG AAA CAG GCT TAC TCT 877 

AGE C TTC CCT GG | TTA CTC ATG TCC 47 9 

AGA GAC TTC CCT GGG | AGT GTA CTT TTC CTG CCT CAC AGT TAC ATT GGT AAT TCT GGC ATG TCC 93 7 

AGE TCA AAG ACG ACT CAT GAT GCT GGA TAT GAA GAA CTA TTG ACT TCT TGG GAA AAA ACG GAG 53 9 

A6A TCA AAA ATG ACT CAT GAC TGT GGA TAT GAA GAA CTA TTG ACT TCT TAA GAA AAA AGG GAG 997 

AGE AAA TAT TAA TTG GAA AGT TGA TTT TTA TGA TAA TAT GGA AAA CCT AAC CAT TAT AAA AAA 59 9 

AGA AAA TAT TAA TCA GAA AGT TGA TTC TTA TGA TAA TAT GGA AAA GTT AAC CAT TAT AGA AAA 1057 

A6E GCA AA CTT GAG TTT CCT AAA TGT AAG CAT TTA AAG TAA ATG CAT ATT TGT TTT AAA AAA 65 8 

AGA GCA AAG CTT GAG TTT CCT AAA TGT AAG CTT TTA AAG TAA TG AAC ATT AA AAA AAA CCA 1115 

A6E TTA 

A6A TTA TTT CAC TGT CA 
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Figure 5 



AlO 
alO 



C CAG GGC CCC CAT CCA GCA TCA 2 2 
C CAG GGT TCC CGG CCA GGG TCA 7 4 



AlO MKAE ATVIPSRCARGLPSW 

AlO ATG AAA GCA GAA GCC ACA GTT ATT CCC AGC CGT TGT GCT AGG GGG CTC CCA TCA TGG 7 9 

alO ATG GCT GGC CAA GCT CCC ACA GCG GTT CCC GGC AGT GTT ACT GGA GAA GTC TCA CGA TGG 134 

AlO QVLSPVQPWQTSAPQNTTQP 

AlO CAA GTC CTC AGC CCA GTC CAG CCC TGG CAG ACA AGT GCA CCC CAG AAC ACG ACC CAG CCC 13 9 

alO CAG AAC CTA GGA CCT GCC CAG CCT GCA CAG AAA GTA GCC CAG CCC 17 9 

AlO KLLAPHQH EKSQKKSSLLK 

AlO AAG CTC CTG GCT CCA CAC CAG CAC GAG AAG TCC CAG AAG AAG AGC AGC CTT CTT AAG 196 

alO CAA AAC CTG GTT CCA GAT GGG CAC CTT GAG AAA GCC CTG GAG GGA AGT GAC CTT CTA CAG 239 

AlO ELGAFHITIALLHLVFGGYL 

AlO GAG CTG GGG I GCC TTC CAC ATC ACC ATC GCT CTG CTG CAC CTG GTC TTT GGG GGC TAC CTG 256 

alO AAG TTG GGT GGC TTT CAC ATC GCC ATT GCT TTT GCA CAC TTG GCC TTT GGT GGC TAC TTG 299 

AlO ASIVKNLHLVVLKSWYPFWG 

AlO GCC TCT ATA GTC AAG AAC CTT CAC CTG GTG GTG CTG AAG TCT TGG TAT CCA TTC TGG GGG 316 

alO ATC TCC ACA GTC AAG AAT CTG CAC CTG GTG GTG TTG AAG TGC TGG TAT CCA CTC TGG GGA 359 

AlO AASFLI SGILAITMK'TFSKT 

AlO GCT GCC TCT | TTT CTC ATT TCA GGG ATC TTG GCG ATA ACA ATG AAG ACC TTT TCT AAA ACT 37 6 

a 10 ACT GTC TCT TTT CTC GTT GCA GGG ATG GCG GCC ATG ACA ACA GTG ACA TTC CCT AAG ACC 419 

AlO YLKMLCLMTNLISLFCVLSG 

AlO TAC CTG 1 AAG ATG TTG TGC CTG ATG ACA AAC CTC ATC AGC CTC TTT TGC GTG CTG TCT GGC 43 6 

alO TCT CTG AAG GTT CTC TGT GTG ATA GCC AAC GTT ATC AGC TTG TTC TGC GCA CTG GCC GGC 479 

AlO LFVI SKDLFLESPFESPIWR 

AlO CTC TTC GTC ATC TCC AAG GAT CTC TTT CTG GAG AGC CCA TTT GAG TCC CCG ATC TGG AGA 496 

alO TTC TTT GTC ATT GCC AAG GAC CTC TTC CTG GAG GGT CCT TTT CCA TGG CCA ATC TGG AGA 53 9 

AlO MYPNSTVHIQRLELALLCFT 

AlO ATG TAC CCC AAC TCC ACG | GTC CAC ATC CAG AGG CTG GAG CTG GCC TTG CTC TGC TTC ACT 556 

alO CCA TAC CCT GAA CCC ACA ACC TAC ATC CAA AGG CTA GAG CTG ACC CTG TTC TGC TTC ACC 599 

AlO VLELFLPVPTAVTAWRGDCP 

AlO GTC CTA GAG CTC TTC CTG CCA GTG CCC ACA GCT GTC ACA GCC TGG AGA GGG GAC TGC CCA 616 

alO TTC CTG GAG ATC TTC CTG TCG GGG TCC ACA GCC ATC ACA GCC TAT AGG ATG AAA CGC CTG 659 

AlO SAK NDDACLVPNTPLHLKG 

AlO TCT GCA AAG |AAT GAT GAT GCA TGC CTT GTT CCG AAT ACA CCA TTG CAT CTC AAA GGC 67 3 

alO CAA GCA GAG GAC AAG GAT GAC ACC CCT TTC GTT CCT GAC ACA CCT ATG GAA CTT AAA GGT 719 

AlO LPVEPPPSYQSVIQGDAQHK 

AlO CTG CCG GTG GAG CCC CCG CCA TCC TAC CAG AGT GTG ATT CAA GGC GAC GCA CAA CAC AAG 73 3 

alO CTA TCG CTG GGG CCA CCA CCA TCC TAT AAA GAT GTG GCC CAA GGA CAC TCC TCC TCT GAT 77 9 

AlO Q H Q R * 

AlO CAA CAT CAG AGG TGA AGA GGT TTG GCC CTG GCT CCC CAG ACC TTC AGA ACC CAC GCT TGG 793 

alO ACT GGC AGA GCC TTG GCC ACC AGC TCT GGA CTG CTG CTG GCC TCT GAC TCC TTC CAT CAA 83 9 



AlO CCC CTT CTC CCT GGC ATA GAT AGA AAC CAG TTC CCA GGG GAGCTGTAAG GGGCAAGGGG 

alO GCT CTG CTT CAC ACG GGC CCA AGG ACC TTG AGA AAG (TAG TCTAAGACTA CTAGACTTTC 

AlO CTAACAGTGC TGC AG CT AG A ACTGCCTGGC CTGGGAGTGG TAGCCAAGGC AGCAATGGTC CTATCCCCTC 

alO CC (A A 'I 'A a"a] G A CTGAGTTCCT GCAATGCCAA GTCCTTCCCT GCCTCCCATT GGCTGGACTC CGG TGA TGA C 

AlO AGGCACTAGA GGAACTC CCA GGGCC CCCCA GG AAC ATCTC AGCACCTGAC 

alO ACAGGCGTGA CTTGCACA jAA TAAAfr GCATA ACCACTATGA AAAAAAAAAA 
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Figure 7 



Al MTTPRNSVNGTFPAEPMKGPI AMQSGPKPLFRRMSSLVGPTQSFFMRESKTLG 

al MSGPFPAEPTKGPLAMQPAPKVNLKRTSSLVGPTQSFFMRESKALG 

A2 MDTESNRRANLALP QEPS | SVPAFEVLEI SPQEVSSGRLLKSASSPPLHTWLTVLKKEQEFLG 

a2 MDTENRSRADLALPNPQESS SAPD I ELLEASP AKAAPPKQTWRTFLKKELEFLG 

A3 MASHEVDNAELGSASAHGTPGSETGPEELNTSVYHP I NGS PDYQKAKLQVLG 

a3 MKPEETGGSVYQPLDESRHVQRGVLQALG 

A4A MTTMQGMEQAMPGAGPGVPQLGNMAVIHSHLWKGLQEKFLKGEPKVLG 

a4B MQGQEQTTMAWPGVAVPSKNSVMTSQMWNEKKEKFLKGEPKVLG 

a4C MQGQEQTTMAWPGGAPPSENSVMKSQMWNENKEKFLKGEPKVLG 

a4D MQGLAQTTMAWPGGAPPSENSVI KSQMWNKNKEKFLKGEPKVLG 

A4E MTTMCGMEQTTPGPGPDVPQLGNIDVIHSYLCKGLQEKFFfeKPKVLG 

A5 MDSSTAHSPVFLVFPPEITASEYESTELSATTFSTQSPLQKLFARKMKILG 

A6A MTSQPVPNET 1 1 VLPSNVI NFSQAEKPEPTNQGQDSLKKHLHAEI KVI G 

a6B MI PQWTSETVAMISPNGMSLPQTDKPQPFHQWQDSLKi^tKAEIKVMA 

a 6 C MI PQWTNET ITT IS PNG I N FPQKD ESQPTQQRQDS LKKHLKAE I KV I V 

a6D MI PQWT S ETVTV I S PNG I S FPQTDKPQPSHQ SQD SL KKHLKAE I KVMA 

A6 E MTSQP I SNET 1 1 MLPSNVI NFSQAEKPEPTNQGQDSLKKRLQAKVKV I G 

A7 MLLQSQTMGVSHSFTPKGI TI PQREKPGHMYQNEDYLQNGLPTETTVLG 

a7 MRLQLGTKNIGWDCFPKDII IHKREKTGHTYEKEDDLLIGVPSEATLLG 

A8B MNSMTSAVPVANSVLWAPHNGYPVTPGIMSHVPLYPNSQPQVHLVPGNPPSLVSNVNGQPVQKALKEGKTLG 
a8B MEPEQERLTWQPGWSNINTVTSPGPMANSVYWAPPNSYPWPGTVPQMPIYPSNQ 

AlO MKAEA TVIPSRCARGLPSWQVLSPVQPWQTSAPQNTTQPKLLAPHQH EKSQKKSSLLKELG 

alO MAGQAPTAVPGSVTGEVSRWQNLG PAQPAQKVA QPQNLVPDGHLEKALEGSDLLQKLG 
Al 2 MMSSKPTSHAEVNETI PNPYPPGSFMAPGFQQPLGSINLENQAQGAQRAQPYGI TSPGI FASSQPGQGNIQMI NPSVGTAVMNFKEEAKALG 
al2 MMSSKPTTYPGVYGTTPDLYPPSNFMVPGSQQPPGFINPRIQVQSSQ APFI VSPGI PNNSQQVQGNIQMVNPGTGKAATNFKEEAKTLG 
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VKGKM I MNSLSLFAAI SGMI LS I MD I LN I KI SHFLKMESLNF I RAHTP Y I NI YNCEPANPSEKNSPSTQY 
VKAKVIMSSLSLFAAISGI ILSIMDILNMTLSHFLKMRRLELIQTSKPYVDI YDCEPSNSSEKNSPSTQY 
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CYSIQSLFL | GILSVMLIFAFFQELVIAGIVENEWKRTCSRPKS | 
CNS I QSVFL G I LSAML I SAFFQKLVTAGI VENEWKRMCTRSKS 
EIWMMLFLTILGLGSAVSLTICGAGEELKGNKj 
ELVLMMLFLTI LAFCSAVLFTI YRIGQELESKK 
GMVSLLLI LTLLELCVTI ST I AMWCNANCCNSRE 
GLDGMVLLLSVLEFCI AVSLSAFGCKVLCCTPGG 
GLDVLMLI FNMLEFCLAVSVSAFGCEASCCNSRE 
GLDI LML I LNMLEFC I AVS I SAFGCKASCCNSSE 
S LDVLMT I LNMLE FC I AVSVSAFGC KASCCNS SE 
GTDGMVLLLSVLEFCI AVSLSAFGCKVLCCSPSE | 
GI LI TLMTFS I I ELFI SLPFSI LGCHSEDCDCEQCC 



NI VLLSAEEKKEQTIEIKEEWGLTETSSQPKNEEDI EI I PIQEEEEEETETNFPEPPQDQESSPIENDSSP297 
NWLLSAGEKNEQTIKMKEEI IELSGVSSQPKNEEEIEI IPVQEEEEEEAEINFPAPPQEQESLPVENEI AP2 91 
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GTLSLMLICTLLEFCLAVLTAVLRWKQAYSDFPG j SVLFLPHSYIGNSGMSSKMTHDCGYEELLTS 
GVFSLMLI STMLELGLAVLTAMLWWKQSHSNI PG NVMFLPHSSNNDSNMESKVLCNPSYEEQLVC 
GALSVMLI I SVLELGLALLSAMLWLREGVLTSLRM 

GVFSLMLISSVLELGLAVLTATLWWKQSSSAFSG NVI FLSQNSKNKSSVSSESLCNPTYENI LTS 
GTLS LMLVSTVLE FCLAVLTAVLQWKQTV 

GVLWMLI FTVLELLLAAYSSVFWWKQLYSNNPG j SSFSSTQSQDHIQCVKKSSSRSWI 
SALGMMLLFTVLEVFLAGYSSI FWWKQVYSNKPG GTFFLPQSQDHTQLVKSNLLQ 
AWGV| NPGMAISGVLLVFCLLEFGIACASSHFGCQLVCCQSSNj VSVIYP NI YAANPVI TPE PVTSPPSYSSEIQANK 
ENLGV RTGVAI SSVLLI FCLLELSI ASVSSHFGCQVACCHYNN PGWI P | NVYAANPWI PEPPNPI PSYSEWQDSR 

| VHIQRLELALLCFTVLELFLPVPTAVTAWRGDCPSAK | NDDACLVPNTPLHLKGLPVEPPPSYQSVIQGDAQHKQHQR 
TYIQRLELTLFCFTFLEI FLSGSTAITAYRMKRLQAEDKDDTPFVPDTPMELKGLSLGPPPSYKDVACCHSSSDTGRALATSSGLLLASDSFHQALLHTGPRTLRK 
DYWAV | LSGKGI SATLMI FSLLEFFVACATAHFANQANTTTNM j SVLVI PNMYESNPVTPASSSAPPRCNNYSANAPK 
DYWAV LSGKGI SAMLI I FSLLEFCI TCVTAYFASHTITNTRGL SWSFHLCMQTVP 
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<110> Tedder, Thomas 
Liang, Yinghua 

<120> Identification of Novel MS4A Gene Family Members Expressed by 
Hematopoietic Cells 

<130> 180/132 

<160> 81 

<170> Patentln version 3.0 

<210> 1 

<211> 1619 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> cDNA 

<222> (1) . . (1619) 

<220> 

<221> CDS 

<222> (144) . . (806) 

<400> 1 

aacattcctg caaatggttt caatatatgc agatgtctcg atataggaat gaaattacgt 60 

ctttggaaca acttaaataa gtcaaatata cttggagctt taaaaattaa aaggagagag 120 

attcgagcac cttttctgct gcc atg aca acc atg caa gga atg gaa cag gcc 173 

Met Thr Thr Met Gin Gly Met Glu Gin Ala 
15 10 
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atg cca ggg get ggc cct ggt gtg ccc cag ctg gga aac atg get gtc 221 
Met Pro Gly Ala Gly Pro Gly Val Pro Gin Leu Gly Asn Met Ala Val 

15 20 25 

ata cat tea cat ctg tgg aaa gga ttg caa gag aag ttc ttg aag gga 269 
lie His Ser His Leu Trp Lys Gly Leu Gin Glu Lys Phe Leu Lys Gly 

30 35 40 

gaa ccc aaa gtc ctt ggg gtt gtg cag att ctg act gee ctg atg age 317 
Glu Pro Lys Val Leu Gly Val Val Gin lie Leu Thr Ala Leu Met Ser 
45 50 55 

ctt age atg gga ata aca atg atg tgt atg gca tct aat act tat gga 365 
Leu Ser Met Gly lie Thr Met Met Cys Met Ala Ser Asn Thr Tyr Gly 
60 65 70 

agt aac cct att tec gtg tat ate ggg tac aca att tgg ggg tea gta 413 
Ser Asn Pro lie Ser Val Tyr lie Gly Tyr Thr lie Trp Gly Ser Val 
75 80 85 90 

atg ttt att att tea gga tec ttg tea att gca gca gga att aga act 461 
Met Phe lie lie Ser Gly Ser Leu Ser lie Ala Ala Gly lie Arg Thr 

95 100 105 

aca aaa ggc ctg gtc cga ggt agt eta gga atg aat ate ace age tct 509 
Thr Lys Gly Leu Val Arg Gly Ser Leu Gly Met Asn lie Thr Ser Ser 

110 115 120 

gta ctg get gca tea ggg ate tta ate aac aca ttt age ttg gcg ttt 557 
Val Leu Ala Ala Ser Gly lie Leu lie Asn Thr Phe Ser Leu Ala Phe 
125 130 135 

tat tea ttc cat cac cct tac tgt aac tac tat ggc aac tea aat aat 605 
Tyr Ser Phe His His Pro Tyr Cys Asn Tyr Tyr Gly Asn Ser Asn Asn 
140 145 150 

tgt cat ggg act atg tec ate tta atg ggt ctg gat ggc atg gtg etc 653 
Cys His Gly Thr Met Ser lie Leu Met Gly Leu Asp Gly Met Val Leu 
155 160 • 165 170 

etc tta agt gtg ctg gaa ttc tgc att get gtg tec etc tct gee ttt 701 
Leu Leu Ser Val Leu Glu Phe Cys lie Ala Val Ser Leu Ser Ala Phe 

175 180 185 

gga tgt aaa gtg etc tgt tgt ace cct ggt ggg gtt gtg tta att ctg 749 
Gly Cys Lys Val Leu Cys Cys Thr Pro Gly Gly Val Val Leu lie Leu 

190 195 200 

cca tea cat tct cac atg gca gaa aca gca tct ccc aca cca ctt aat 797 
Pro Ser His Ser His Met Ala Glu Thr Ala Ser Pro Thr Pro Leu Asn 
205 210 215 

gag gtt tga ggccaccaaa agatcaacag acaaatgetc cagaaatcta 846 
Glu Val 
220 

tgctgactgt gacacaagag cctcacatga gaaattacca gtatccaact tcgatactga 906 



tagacttgtt gatattatta ttatatgtaa tccaattatg aactgtgtgt gtatagagag 966 
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ataataaatt 


caaaattatg 


ttctcatttt 


tttccctgga 


actcaataac 


tcatttcact 


ggctctttat 


cgagagtact 


agaagttaaa 


ttaataaata 


atgcatttaa 


tgaggcaaca 


gcacttgaaa 


gtttttcatt 


catcataaga 


actttatata 


aaggcattac 


attggcaaat 


aaggtttgga 


agcagaagag 


caaaaaaaag 


atattgttaa 


aatgaggcct 


ccatgcaaaa 


cacatacttc 


cctcccattt 


atttaacttt 


ttttttctcc 


tacctatggg 


gaccaaagtg 


ctttttcctt 


caggaagtgg 


agatgcatgg 


ccatctcccc 


ctcccttttt 


ccttctcctg 


cttttctttc 


cccatagaaa 


gtaccttgaa 


gtagcacagt 


ccgtccttgc 


atgtgcacga 


gctatcattt 


gagtaaaagt 


atacatggag 


taaaaatcat 


attaagcatc 


agattcaact 


tatattttct 


atttcatctt 


cttcctttcc 


cttctcccac 


cttctactgg 


gcataattat 


atcttaatca 


tatatggaaa 


tgtgcaacat 


atggtatttg 


ttaaatacgt 


ttgtttttat 


tgcagagcaa 


aaataaatca 


aattagaagc 


aataaaaaaa 


aaaaaaaaaa 


aaa 



<210> 2 

<211> 220 

<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Thr Thr Met Gin Gly Met Glu Gin Ala Met Pro Gly Ala Gly Pro 
15 10 15 



Gly Val Pro Gin Leu Gly Asn Met Ala Val lie His Ser His Leu Trp 

20 25 30 



Lys Gly Leu Gin Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly 
35 40 45 



Val Val Gin lie Leu Thr Ala Leu Met Ser Leu Ser Met Gly lie Thr 
50 55 60 



Met Met Cys Met Ala Ser Asn Thr Tyr Gly Ser Asn Pro lie Ser Val 
65 70 75 80 



Tyr lie Gly Tyr Thr lie Trp Gly Ser Val Met Phe lie lie Ser Gly 

85 90 95 



1026 
1086 
1146 
1206 
1266 
1326 
1386 
1446 
1506 
1566 
1619 



Ser Leu Ser lie Ala Ala Gly lie Arg Thr Thr Lys Gly Leu Val Arg 

100 105 110 
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Gly Ser Leu Gly Met Asn lie Thr Ser Ser Val Leu Ala Ala Ser Gly 
115 120 125 



lie Leu lie Asn Thr Phe Ser Leu Ala Phe Tyr Ser Phe His His Pro 
130 135 140 



Tyr Cys Asn Tyr Tyr Gly Asn Ser Asn Asn Cys His Gly Thr Met Ser 
145 150 155 160 



lie Leu Met Gly Leu Asp Gly Met Val Leu Leu Leu Ser Val Leu Glu 

165 170 175 



Phe Cys lie Ala Val Ser Leu Ser Ala Phe Gly Cys Lys Val Leu Cys 

180 185 190 



Cys Thr Pro Gly Gly Val Val Leu lie Leu Pro Ser His Ser His Met 
195 200 205 



Ala Glu Thr Ala Ser Pro Thr Pro Leu Asn Glu Val 
210 215 220 



<210> 3 

<211> 663 

<212> DNA 

<213 > Homo sapiens 



<220> 

<221> CDS 

<222> (1) . . (663) 



<400> 3 

atg aca acc atg caa gga atg gaa cag acc act cca ggg cct ggc cct 48 
Met Thr Thr Met Gin Gly Met Glu Gin Thr Thr Pro Gly Pro Gly Pro 
15 10 15 

gat gtg ccc cag ctg gga aac ata gat gtc ata cat tea tat ctg tgt 96 
Asp Val Pro Gin Leu Gly Asn lie Asp Val lie His Ser Tyr Leu Cys 

20 25 30 



aaa gga ttg caa gag aag ttc ttc aag agg aaa ccc aaa gtc ctt ggg 
Lys Gly Leu Gin Glu Lys Phe Phe Lys Arg Lys Pro Lys Val Leu Gly 
35 40 45 



144 
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gtt gtg egg att ctg att gec ttg atg age ctt age atg gga ata ata 192 
Val Val Arg lie Leu lie Ala Leu Met Ser Leu Ser Met Gly lie lie 
50 55 60 

atg atg tgt gtt gca ttt agt tct tat gaa gaa cat ccc att ttt gtg 240 
Met Met Cys Val Ala Phe Ser Ser Tyr Glu Glu His Pro lie Phe Val 
65 70 75 80 

tat gtt gcg tac aca att tgg ggg tea gtg atg ttt att att tea gta 288 
Tyr Val Ala Tyr Thr lie Trp Gly Ser Val Met Phe lie lie Ser Val 

85 90 95 

tec tta tea gtt gca gca gga att aga aca aca aaa ggt ctg gtt gga 336 
Ser Leu Ser Val Ala Ala Gly lie Arg Thr Thr Lys Gly Leu Val Gly 

100 105 110 

ggt agt eta gga aag aat ate acc agt tea gtc ttg get ata tea ggg 3 84 

Gly Ser Leu Gly Lys Asn lie Thr Ser Ser Val Leu Ala lie Ser Gly 
115 120 125 

ate tta ate aat gca ata age ttg acg ttt tat tea ttc cgt tac cat 432 
lie Leu lie Asn Ala He Ser Leu Thr Phe Tyr Ser Phe Arg Tyr His 
130 135 140 

tac tgt aac cac gat cag ttg tea agt aat tgt tac atg act atg tec 480 
Tyr Cys Asn His Asp Gin Leu Ser Ser Asn Cys Tyr Met Thr Met Ser 
145 150 155 160 

att tta atg ggt acg gat ggc atg gtg etc etc tta agt gtg ctg gaa 528 
He Leu Met Gly Thr Asp Gly Met Val Leu Leu Leu Ser Val Leu Glu 

165 170 175 

ttc tgc att get gtg tec etc tct gee ttt gga tgt aaa gtg ctt tgt 576 
Phe Cys lie Ala Val Ser Leu Ser Ala Phe Gly Cys Lys Val Leu Cys 

180 185 190 

tgt age ccc agt gag ttt gtg eta att aca cca tea aat tct cac atg 624 
Cys Ser Pro Ser Glu Phe Val Leu He Thr Pro Ser Asn Ser His Met 
195 200 205 

gca gaa ata gca tct ccc aca cca ctt aag aca gtt tga 663 
Ala Glu He Ala Ser Pro Thr Pro Leu Lys Thr Val 
210 215 220 



<210> 4 

<211> 220 

<212> PRT 

<213> Homo sapiens 



<400> 4 



Met Thr Thr Met Gin Gly Met Glu Gin Thr Thr Pro Gly Pro Gly Pro 
15 10 15 
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Asp Val Pro Gin Leu Gly Asn lie Asp Val lie His Ser Tyr Leu Cys 

20 25 30 



Lys Gly Leu Gin Glu Lys Phe Phe Lys Arg Lys Pro Lys Val Leu Gly 
35 40 45 



Val Val Arg lie Leu lie Ala Leu Met Ser Leu Ser Met Gly lie lie 
50 55 60 



Met Met Cys Val Ala Phe Ser Ser Tyr Glu Glu His Pro lie Phe Val 
65 70 75 80 



Tyr Val Ala Tyr Thr lie Trp Gly Ser Val Met Phe lie lie Ser Val 

85 90 95 



Ser Leu Ser Val Ala Ala Gly lie Arg Thr Thr Lys Gly Leu Val Gly 

100 105 110 



Gly Ser Leu Gly Lys Asn lie Thr Ser Ser Val Leu Ala lie Ser Gly 
115 120 125 



lie Leu lie Asn Ala lie Ser Leu Thr Phe Tyr Ser Phe Arg Tyr His 
130 135 140 



Tyr Cys Asn His Asp Gin Leu Ser Ser Asn Cys Tyr Met Thr Met Ser 
145 150 155 160 



lie Leu Met Gly Thr Asp Gly Met Val Leu Leu Leu Ser Val Leu Glu 

165 170 175 



Phe Cys lie Ala Val Ser Leu Ser Ala Phe Gly Cys Lys Val Leu Cys 

180 185 190 



Cys Ser Pro Ser Glu Phe Val Leu lie Thr Pro Ser Asn Ser His Met 
195 200 205 



Ala Glu lie Ala Ser Pro Thr Pro Leu Lys Thr Val 
210 215 220 



<210> 5 
<211> 694 
<212> DNA 



<213> Homo sapiens 
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<220> 

<221> cDNA 
<222> (1) . . (694) 



<220> 

<221> CDS 

<222> (55) . . (657) 



<400> 5 

ctagactgaa gtaccaacta aatcatctcc tttcaaatta tcaccgacac catc atg 57 

Met 
1 

gat tea age ace gca cac agt ccg gtg ttt ctg gta ttt cct cca gaa 105 
Asp Ser Ser Thr Ala His Ser Pro Val Phe Leu Val Phe Pro Pro Glu 

5 10 15 

ate act get tea gaa tat gag tec aca gaa ctt tea gee acg ace ttt 153 
lie Thr Ala Ser Glu Tyr Glu Ser Thr Glu Leu Ser Ala Thr Thr Phe 
20 25 30 

tea act caa age ccc ttg caa aaa tta ttt get aga aaa atg aaa ate 201 
Ser Thr Gin Ser Pro Leu Gin Lys Leu Phe Ala Arg Lys Met Lys lie 
35 40 45 

tta ggg act ate cag ate ctg ttt gga att atg ace ttt tct ttt gga 249 
Leu Gly Thr lie Gin lie Leu Phe Gly lie Met Thr Phe Ser Phe Gly 
50 55 60 65 

gtt ate ttc ctt ttc act ttg tta aaa cca tat cca agg ttt ccc ttt 297 
Val lie Phe Leu Phe Thr Leu Leu Lys Pro Tyr Pro Arg Phe Pro Phe 

70 75 80 

ata ttt ctt tea gga tat cca ttc tgg ggc tct gtt ttg ttc att aat 345 
lie Phe Leu Ser Gly Tyr Pro Phe Trp Gly Ser Val Leu Phe lie Asn 

85 90 95 

tct gga gee ttc eta att gca gtg aaa aga aaa ace aca gaa act ctg 393 
Ser Gly Ala Phe Leu lie Ala Val Lys Arg Lys Thr Thr Glu Thr Leu 
100 105 110 

ata ata ttg age cga ata atg aat ttt ctt agt gee ctg gga gca ata 441 
lie lie Leu Ser Arg lie Met Asn Phe Leu Ser Ala Leu Gly Ala lie 
115 120 125 

get gga ate att etc etc aca ttt ggt ttc ate eta gat caa aac tac 489 
Ala Gly lie lie Leu Leu Thr Phe Gly Phe lie Leu Asp Gin Asn Tyr 
130 135 140 145 



att tgt ggt tat tct cac caa aat agt cag tgt aag get gtt act gtc 
lie Cys Gly Tyr Ser His Gin Asn Ser Gin Cys Lys Ala Val Thr Val 



537 
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150 155 160 

ctg ttc ttg gga att ttg att aca ttg atg act ttc age att att gaa 585 
Leu Phe Leu Gly lie Leu lie Thr Leu Met Thr Phe Ser lie lie Glu 

165 170 175 

tta ttc att tct ctg cct ttc tea att ttg ggg tgc cac tea gag gat 633 
Leu Phe lie Ser Leu Pro Phe Ser lie Leu Gly Cys His Ser Glu Asp 
180 185 190 

tgt gat tgt gaa caa tgt tgt tga ctagcactgt gagaataaag atgtgttaaa 687 
Cys Asp Cys Glu Gin Cys Cys 
195 200 



ataaaaa 

<210> 6 

<211> 200 

<212> PRT 

<213> Homo sapiens 



<400> 6 

Met Asp Ser Ser Thr Ala His Ser Pro Val Phe Leu Val Phe Pro Pro 
15 10 15 



Glu lie Thr Ala Ser Glu Tyr Glu Ser Thr Glu Leu Ser Ala Thr Thr 

20 25 30 



Phe Ser Thr Gin Ser Pro Leu Gin Lys Leu Phe Ala Arg Lys Met Lys 
35 40 45 



lie Leu Gly Thr lie Gin lie Leu Phe Gly lie Met Thr Phe Ser Phe 
50 55 60 



Gly Val lie Phe Leu Phe Thr Leu Leu Lys Pro Tyr Pro Arg Phe Pro 
65 70 75 80 



Phe lie Phe Leu Ser Gly Tyr Pro Phe Trp Gly Ser Val Leu Phe lie 

85 90 95 



Asn Ser Gly Ala Phe Leu lie Ala Val Lys Arg Lys Thr Thr Glu Thr 

100 105 110 



694 



Leu lie lie Leu Ser Arg lie Met Asn Phe Leu Ser Ala Leu Gly Ala 
115 120 125 
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lie Ala Gly lie lie Leu Leu Thr Phe Gly Phe lie Leu Asp Gin Asn 
130 135 140 



Tyr lie Cys Gly Tyr Ser His Gin Asn Ser Gin Cys Lys Ala Val Thr 
145 150 155 160 



Val Leu Phe Leu Gly lie Leu lie Thr Leu Met Thr Phe Ser lie He 

165 170 175 



Glu Leu Phe He Ser Leu Pro Phe Ser He Leu Gly Cys His Ser Glu 

180 185 190 



Asp Cys Asp Cys Glu Gin Cys Cys 
195 200 



<210> 7 

<211> 1152 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> cDNA 

<222> (1) . . (1152) 

<223> n is an undetermined nucleotide (dATP , dCTP, dGTP , or dTTP) 
<220> 

<221> CDS 

<222> (239) . . (985) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<400> 7 

ttttactgac cttgctagaa gtttacagnc acggaagtgc aggaacattt cacaaatcta 60 

caatctgtga gtatcacatc ctgtatagct gtaaacactg gaataaggaa gggctgatga 120 

ctttcagaag atgaaggtaa gtagaaaccg ttgatgggac tgagaaacca gagttaaaac 180 

ctctttggag cttctgagga ctcagctgga accaacgggc acagttggca acaccatc 23 8 

atg aca tea caa cct gtt ccc aat gag acc ate ata gtg etc cca tea 286 
Met Thr Ser Gin Pro Val Pro Asn Glu Thr He He Val Leu Pro Ser 
15 10 15 

aat gtc ate aac ttc tec caa gca gag aaa ccc gaa ccc acc aac cag 334 
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Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 

20 25 30 

ggg cag gat age ctg aag aaa cat eta cac gca gaa att aaa gtt att 3 82 

Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu lie Lys Val lie 
35 40 45 

ggg act ate cag ate ttg tgt ggc atg atg gta ttg age ttg ggg ate 430 
Gly Thr lie Gin lie Leu Cys Gly Met Met Val Leu Ser Leu Gly lie 
50 55 60 

att ttg gca tct get tec ttc tct cca aat ttt ace caa gtg act tct 478 
lie Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 
65 70 75 80 

aca ctg ttg aac tct get tac cca ttc ata gga ccc ttt ttt ttt ate 526 
Thr Leu Leu Asn Ser Ala Tyr Pro Phe lie Gly Pro Phe Phe Phe lie 

85 90 95 

ate tct ggc tct eta tea ate gee aca gag aaa agg tta ace aag ctt 574 
lie Ser Gly Ser Leu Ser lie Ala Thr Glu Lys Arg Leu Thr Lys Leu 

100 105 110 

ttg gtg cat age age ctg gtt gga age att ctg agt get ctg tct gee 622 
Leu Val His Ser Ser Leu Val Gly Ser lie Leu Ser Ala Leu Ser Ala 
115 120 125 

ctg gtg ggt ttc att ate ctg tct gtc aaa cag gee ace tta aat cct 670 
Leu Val Gly Phe lie lie Leu Ser Val Lys Gin Ala Thr Leu Asn Pro 
130 135 140 

gee tea ctg cag tgt gag ttg gac aaa aat aat ata cca aca aga agt 718 
Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn lie Pro Thr Arg Ser 
145 150 155 160 

tat gtt tct tac ttt tat cat gat tea ctt tat ace acg gac tgc tat 766 
Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Cys Tyr 

165 170 175 

aca gee aaa gee agt ctg get gga act etc tct ctg atg ctg att tgc 814 
Thr Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met Leu lie Cys 

180 185 190 

act ctg ctg gaa ttc tgc eta get gtg etc act get gtg ctg egg tgg 862 
Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Arg Trp 
195 200 205 

aaa cag get tac tct gac ttc cct ggg agt gta ctt ttc ctg cct cac 910 
Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu Phe Leu Pro His 
210 215 220 

agt tac att ggt aat tct ggc atg tec tea aaa atg act cat gac tgt 958 
Ser Tyr lie Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
225 230 235 " 240 

gga tat gaa gaa eta ttg act tct taa gaaaaaaggg agaaatatta 1005 
Gly Tyr Glu Glu Leu Leu Thr Ser 

245 

atcagaaagt tgattcttat gataatatgg aaaagttaac cattatagaa aagcaaagct 1065 
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tgagtttcct aaatgtaagc ttttaaagta atgaacatta aaaaaaacca ttatttcact 1125 
gtcaaaaaaa aaaaaaaaaa aaaaaaa 1152 

<210> 8 

<211> 248 

<212> PRT 

<213> Homo sapiens 



<400> 8 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr lie lie Val Leu Pro Ser 
15 10 15 



Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 

20 25 30 



Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu lie Lys Val lie 
35 40 45 



Gly Thr lie Gin lie Leu Cys Gly Met Met Val Leu Ser Leu Gly lie 
50 55 60 



lie Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 
65 70 75 80 



Thr Leu Leu Asn Ser Ala Tyr Pro Phe lie Gly Pro Phe Phe Phe lie 

85 90 95 



lie Ser Gly Ser Leu Ser lie Ala Thr Glu Lys Arg Leu Thr Lys Leu 

100 105 110 



Leu Val His Ser Ser Leu Val Gly Ser lie Leu Ser Ala Leu Ser Ala 
115 120 125 



Leu Val Gly Phe lie lie Leu Ser Val Lys Gin Ala Thr Leu Asn Pro 
130 135 140 



Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn lie Pro Thr Arg Ser 
145 150 155 160 



Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Cys Tyr 

165 170 175 
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Thr Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met Leu lie Cys 

180 185 190 



Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Arg Trp 
195 200 205 



Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu Phe Leu Pro His 
210 215 220 



Ser Tyr lie Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
225 230 235 240 



Gly Tyr Glu Glu Leu Leu Thr Ser 

245 



<210> 9 

<211> 930 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDNA 
<222> (1) . . (930) 



<220> 

<221> CDS 

<222> (17) . . (763) 



<400> 9 

agttggcaac accatc atg aca tea caa cct gtt ccc aat gag acc ate ata 52 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr lie lie 

15 10 

gtg etc cca tea aat gtc ate aac ttc tec caa gca gag aaa ccc gaa 100 
Val Leu Pro Ser Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu 
15 20 25 

ccc acc aac cag ggg cag gat age ctg aag aaa cat eta cac gca gaa 148 
Pro Thr Asn Gin Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu 
30 35 40 

ate aaa gtt att ggg act ate cag ate ttg tgt ggc atg atg gta ttg 196 
lie Lys Val lie Gly Thr lie Gin lie Leu Cys Gly Met Met Val Leu 



-12- 



WO 02/062946 



PCT/US01/48437 



45 

age ttg ggg 
Ser Leu Gly 



caa gtg act 
Gin Val Thr 



ttt ttt ttt 
Phe Phe Phe 
95 

tta acc aag 
Leu Thr Lys 
110 

get ctg tct 
Ala Leu Ser 
125 

acc tta aat 
Thr Leu Asn 



cca aca aga 
Pro Thr Arg 



50 

ate att ttg 
lie lie Leu 
65 

tct aca ctg 
Ser Thr Leu 
80 

ate ate tct 
lie lie Ser 



ctt ttg gtg 
Leu Leu Val 



gee ctg gtg 
Ala Leu Val 
130 

cct gee tea 
Pro Ala Ser 
145 

agt tat gtt 
Ser Tyr Val 
160 



gca tct get 
Ala Ser Ala 



ttg aac tct 
Leu Asn Ser 
85 

ggc tct eta 
Gly Ser Leu 
100 

cat age age 
His Ser Ser 
115 

ggt ttc att 
Gly Phe lie 



ctg cag tgt 
Leu Gin Cys 



tct tac ttt 
Ser Tyr Phe 
165 



55 

tec ttc tct 
Ser Phe Ser 
70 

get tac cca 
Ala Tyr Pro 



tea ate gee 
Ser lie Ala 



ctg gtt gga 
Leu Val Gly 
120 

ate ctg tct 
lie Leu Ser 
135 

gag ttg gac 
Glu Leu Asp 
150 

tat cat gat 
Tyr His Asp 



cca aat ttt 
Pro Asn Phe 
75 

ttc ata gga 
Phe lie Gly 
90 

aca gag aaa 
Thr Glu Lys 
105 

age att ctg 
Ser lie Leu 



gtc aaa cag 
Val Lys Gin 



aaa aat aat 
Lys Asn Asn 
155 

tea ctt tat 
Ser Leu Tyr 
170 



60 

acc 244 
Thr 



ccc 292 
Pro 



agg 34 0 

Arg 



agt 3 88 

Ser 



gee 436 

Ala 

140 

ata 484 
He 



acc 532 
Thr 



acg gac tgc tat aca gee aaa gee agt ctg get gga tct etc tct ctg 580 
Thr Asp Cys Tyr Thr Ala Lys Ala Ser Leu Ala Gly Ser Leu Ser Leu 
175 180 185 

atg ctg att tgc act ctg ctg gaa ttc tgc eta get gtg etc act get 628 
Met Leu He Cys Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala 
190 195 200 

gtg ctg egg tgg aaa cag get tac tct gac ttc cct ggg agt gta ctt 676 
Val Leu Arg Trp Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu 
205 210 215 220 

ttc ctg cct cac agt tac att ggt aat tct ggc atg tec tea aaa atg 724 
Phe Leu Pro His Ser Tyr He Gly Asn Ser Gly Met Ser Ser Lys Met 

225 230 235 

act cat gac tgt gga tat gaa gaa eta ttg act tct taa gaaaaaaggg 773 
Thr His Asp Cys Gly Tyr Glu Glu Leu Leu Thr Ser 

240 245 

agaaatatta atcagaaagt tgattcttat gataatatgg aaaagttaac cattatagaa 833 

aagcaaagct tgagtttcct aaatgtaagc ttttaaagta atgaacatta aaaaaaacca 893 

ttatttcact gtcaaaaaaa aaaaaaaaaa aaaaaaa 93 0 



<210> 10 
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<211> 248 
<212> PRT 
<213> Homo sapiens 



<400> 10 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr lie lie Val Leu Pro Ser 
15 10 15 



Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 

20 25 30 



Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu lie Lys Val lie 
35 40 45 



Gly Thr lie Gin lie Leu Cys Gly Met Met Val Leu Ser Leu Gly lie 
50 55 60 



lie Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 
65 70 75 80 



Thr Leu Leu Asn Ser Ala Tyr Pro Phe lie Gly Pro Phe Phe Phe lie 

85 90 95 



lie Ser Gly Ser Leu Ser lie Ala Thr Glu Lys Arg Leu Thr Lys Leu 

100 105 110 



Leu Val His Ser Ser Leu Val Gly Ser lie Leu Ser Ala Leu Ser Ala 
115 120 125 



Leu Val Gly Phe lie lie Leu Ser Val Lys Gin Ala Thr Leu Asn Pro 
130 135 140 



Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn lie Pro Thr Arg Ser 
145 150 155 160 



Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Cys Tyr 

165 170 175 



Thr Ala Lys Ala Ser Leu Ala Gly Ser Leu Ser Leu Met Leu lie Cys 

180 185 190 



Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Arg Trp 
195 200 205 
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Lys Gin Ala Tyr Ser Asp Phe Pro Gly Ser Val Leu Phe Leu Pro His 
210 215 220 



Ser Tyr lie Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
225 230 235 240 



Gly Tyr Glu Glu Leu Leu Thr Ser 

245 



<210> 11 

<211> 661 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (15) . . (458) 



<400> 11 

ttggcaacac catt atg aca tea caa cct att tec aat gag acc ate ata 50 

Met Thr Ser Gin Pro lie Ser Asn Glu Thr lie lie 
15 10 

atg etc cca tea aat gtc ate aac ttc tec caa gca gag aaa ccc gaa 98 
Met Leu Pro Ser Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu 
15 20 25 

ccc acc aac cag ggg cag gat age ctg aag aaa cgt eta cag gca aaa 146 
Pro Thr Asn Gin Gly Gin Asp Ser Leu Lys Lys Arg Leu Gin Ala Lys 
30 35 40 



gtc aaa gtt att ggg gtg cat age age ctg get gga age att ctg agt 194 
Val Lys Val lie Gly Val His Ser Ser Leu Ala Gly Ser lie Leu Ser 
45 50 55 60 

get ctg tct gee ctg gtg ggt ttc att etc ctg tct gtc aac ccg get 242 
Ala Leu Ser Ala Leu Val Gly Phe lie Leu Leu Ser Val Asn Pro Ala 

65 70 75 

gca tta aat cct gee tea ttg cag tgt aag ttg gac gaa aag gat ata 290 
Ala Leu Asn Pro Ala Ser Leu Gin Cys Lys Leu Asp Glu Lys Asp lie 

80 85 90 

cca acc aga ctt ctt ctt tct tat gat tat cat tea cct tac acc atg 338 
Pro Thr Arg Leu Leu Leu Ser Tyr Asp Tyr His Ser Pro Tyr Thr Met 
95 100 105 
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gac tgc cat aga gcc aaa gcc agt ctg get gga act ctg tct ctg atg 3 86 

Asp Cys His Arg Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met 
110 115 120 

ctg gtt tct act gtg ttg gag ttc tgc eta get gtg etc act get gtg 434 
Leu Val Ser Thr Val Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val 
125 130 135 140 

ctg cag tgg aaa cag act gtc tga cttccctggt tactcatgtc ctcaaagacg 488 
Leu Gin Trp Lys Gin Thr Val 

145 

actcatgatg ctggatatga agaactattg acttcttggg aaaaaacgga gaaatattaa 548 

ttggaaagtt gatttttatg ataatatgga aaacctaacc attataaaaa agcaaacttg 608 

agtttcctaa atgtaagcat ttaaagtaaa tgcatatttg ttttaaaaaa tta 661 

<210> 12 

<211> 147 

<212> PRT 

<213> Homo sapiens 



<400> 12 

Met Thr Ser Gin Pro lie Ser Asn Glu Thr lie lie Met Leu Pro Ser 
15 10 15 



Asn Val lie Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 

20 25 30 



Gly Gin Asp Ser Leu Lys Lys Arg Leu Gin Ala Lys Val Lys Val lie 
35 40 45 



Gly Val His Ser Ser Leu Ala Gly Ser lie Leu Ser Ala Leu Ser Ala 
50 55 60 



Leu Val Gly Phe lie Leu Leu Ser Val Asn Pro Ala Ala Leu Asn Pro 
65 70 75 80 



Ala Ser Leu Gin Cys Lys Leu Asp Glu Lys Asp lie Pro Thr Arg Leu 

85 90 95 



Leu Leu Ser Tyr Asp Tyr His Ser Pro Tyr Thr Met Asp Cys His Arg 

100 105 110 



Ala Lys Ala Ser Leu Ala Gly Thr Leu Ser Leu Met Leu Val Ser Thr 
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115 120 125 



Val Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Gin Trp Lys 
130 135 140 



Gin Thr Val 
145 



<210> 13 

<211> 1228 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> cDNA 

<222> (1) . . (1228) 



<220> 

<221> CDS 

<222> (118) . . (840) 



<400> 13 

agattgttct ggcaaggaac agccagtggg aggttccagc tgagcgctcc ccagaggtga 60 

gctgatcccc agccacagca cacaggacca ggctgcgaga acagcatcat cagcatc 117 

atg eta tta caa tec caa acc atg ggg gtt tct cac age ttt aca cca 165 

Met Leu Leu Gin Ser Gin Thr Met Gly Val Ser His Ser Phe Thr Pro 
15 10 15 

aa 9 99 c atc act atc cct caa a 9 a 9 a 9 aaa cct 99 a cac at 9 tac caa 213 

Lys Gly He Thr He Pro Gin Arg Glu Lys Pro Gly His Met Tyr Gin 

20 25 30 

aac gaa gat tac ctg cag aac ggg ctg cca aca gaa acc acc gtt ctt 261 
Asn Glu Asp Tyr Leu Gin Asn Gly Leu Pro Thr Glu Thr Thr Val Leu 
35 40 45 

ggg act gtc cag atc ctg tgt tgc ctg ttg att tea agt ctg ggg gee 309 
Gly Thr Val Gin He Leu Cys Cys Leu Leu He Ser Ser Leu Gly Ala 
50 55 60 

atc ttg gtt ttt get ccc tac ccc tec cac ttc aat cca gca att tec 357 
He Leu Val Phe Ala Pro Tyr Pro Ser His Phe Asn Pro Ala He Ser 
65 70 75 80 
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acc act ttg atg tct ggg tac cca ttt tta gga get ctg tgt ttt ggc 405 
Thr Thr Leu Met Ser Gly Tyr Pro Phe Leu Gly Ala Leu Cys Phe Gly 

85 90 95 

att act gga tec etc tea att ate tct gga aaa caa tea act aag ccc 453 
lie Thr Gly Ser Leu Ser lie lie Ser Gly Lys Gin Ser Thr Lys Pro 

100 105 110 

ttt gac ctg age age ttg acc tea aat gca gtg agt tct gtt act gca 501 
Phe Asp Leu Ser Ser Leu Thr Ser Asn Ala Val Ser Ser Val Thr Ala 
115 120 125 

gga gca ggc etc ttc etc ctt get gac age atg gta gee ctg agg act 549 
Gly Ala Gly Leu Phe Leu Leu Ala Asp Ser Met Val Ala Leu Arg Thr 
130 135 140 

gee tct caa cat tgt ggc tea gaa atg gat tat eta tec tea ttg cct 597 
Ala Ser Gin His Cys Gly Ser Glu Met Asp Tyr Leu Ser Ser Leu Pro 
145 150 155 160 

tat teg gag tac tat tat cca ata tat gaa ate aaa gat tgt etc ctg 645 
Tyr Ser Glu Tyr Tyr Tyr Pro lie Tyr Glu lie Lys Asp Cys Leu Leu 

165 170 175 

acc agt gtc agt tta aca ggt gtc eta gtg gtg atg etc ate ttc act 693 
Thr Ser Val Ser Leu Thr Gly Val Leu Val Val Met Leu lie Phe Thr 

180 185 190 

gtg ctg gag etc tta tta get gca tac agt tct gtc ttt tgg tgg aaa 741 
Val Leu Glu Leu Leu Leu Ala Ala Tyr Ser Ser Val Phe Trp Trp Lys 
195 200 205 

cag etc tac tec aac aac cct ggg agt tea ttt tec teg acc cag tea 789 
Gin Leu Tyr Ser Asn Asn Pro Gly Ser Ser Phe Ser Ser Thr Gin Ser 
210 215 220 

caa gat cat ate caa cag gtc aaa aag agt tct tea egg tct tgg ata 837 
Gin Asp His lie Gin Gin Val Lys Lys Ser Ser Ser Arg Ser Trp lie 
225 230 235 240 

taa gtaactcttg gectcagagg aaggaaaagc aactcaacac tcatggtcaa 890 

gtgtgattag actttcctga aatctctgcc attttagata ctgtgaaaca aactaaaaaa 950 

aaaaaagctt ttgttttgta tttgtttact atgagtcgtt atttaatttc tcttgaaaat 1010 

aatttcctca aagcccaagt caataaatgt tatcagecag tcttccaaaa tggtcataaa 1070 

ctttataaac tgctttgggt aaactgagca gaaggtgata cacagaaggg aaaatgtgca 1130 

etcatgetag tgtgaatttg gtaagtcgcg tgactctgea ggctgtttct gtattatttt 1190 

cacactcata ttgcttaaat attacatatt agggattg 1228 



<210> 14 
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<211> 240 
<212> PRT 
<213> Homo sapiens 



<400> 14 

Met Leu Leu Gin Ser Gin Thr Met Gly Val Ser His Ser Phe Thr Pro 
15 10 15 



Lys Gly lie Thr lie Pro Gin Arg Glu Lys Pro Gly His Met Tyr Gin 

20 25 30 



Asn Glu Asp Tyr Leu Gin Asn Gly Leu Pro Thr Glu Thr Thr Val Leu 
35 40 45 



Gly Thr Val Gin lie Leu Cys Cys Leu Leu lie Ser Ser Leu Gly Ala 
50 55 60 



lie Leu Val Phe Ala Pro Tyr Pro Ser His Phe Asn Pro Ala lie Ser 
65 70 75 80 



Thr Thr Leu Met Ser Gly Tyr Pro Phe Leu Gly Ala Leu Cys Phe Gly 

85 90 95 



lie Thr Gly Ser Leu Ser lie lie Ser Gly Lys Gin Ser Thr Lys Pro 

100 105 110 



Phe Asp Leu Ser Ser Leu Thr Ser Asn Ala Val Ser Ser Val Thr Ala 
115 120 125 



Gly Ala Gly Leu Phe Leu Leu Ala Asp Ser Met Val Ala Leu Arg Thr 
130 135 140 



Ala Ser Gin His Cys Gly Ser Glu Met Asp Tyr Leu Ser Ser Leu Pro 
145 150 155 160 



Tyr Ser Glu Tyr Tyr Tyr Pro lie Tyr Glu lie Lys Asp Cys Leu Leu 

165 170 175 



Thr Ser Val Ser Leu Thr Gly Val Leu Val Val Met Leu lie Phe Thr 

180 185 190 



Val Leu Glu Leu Leu Leu Ala Ala Tyr Ser Ser Val Phe Trp Trp Lys 
195 200 205 
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Gin Leu Tyr Ser Asn Asn Pro Gly Ser Ser Phe Ser Ser Thr Gin Ser 
210 215 220 



Gin Asp His lie Gin Gin Val Lys Lys Ser Ser Ser Arg Ser Trp lie 
225 230 235 240 



<210> 15 

<211> 1369 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> cDNA 

<222> (1) . . (1369) 



<220> 

<221> CDS 

<222> (204) . . (956) 



<400> 15 

aaacaggaaa taaatacgaa tgaaactgag ctctaagcag catgtaacct ggcctgcatc 60 

caggaaatag aggacttcgg atccttctaa ccctaccacc caactggccc cagtacattc 12 0 

attctctcag gaaaaaaaac aaggtcccca cagcaaagaa aaggaatagg atcaagagat 180 

acgtggctgc tggcagagca age atg aat teg atg act tea gca gtt ccg gtg 233 

Met Asn Ser Met Thr Ser Ala Val Pro Val 
15 10 

gec aat tct gtg ttg gtg gtg gca ccc cac aat ggt tat cct gtg acc 281 
Ala Asn Ser Val Leu Val Val Ala Pro His Asn Gly Tyr Pro Val Thr 

15 20 25 

cca gga att atg tct cac gtg ccc ctg tat cca aac age cag ccg caa 329 
Pro Gly lie Met Ser His Val Pro Leu Tyr Pro Asn Ser Gin Pro Gin 

30 35 40 

gtc cac eta gtt cct ggg aac cca cct agt ttg gtg teg aat gtg aat 377 
Val His Leu Val Pro Gly Asn Pro Pro Ser Leu Val Ser Asn Val Asn 
45 50 55 

ggg cag cct gtg cag aaa get ctg aaa gaa ggc aaa acc ttg ggg gee 42 5 

Gly Gin Pro Val Gin Lys Ala Leu Lys Glu Gly Lys Thr Leu Gly Ala 
60 65 70 



-20- 



WO 02/062946 



PCT/US01/48437 



ate cag ate ate att ggc ctg get cac ate ggc etc ggc tec ate atg 473 

He Gin He He He Gly Leu Ala His He Gly Leu Gly Ser He Met 

75 80 85 90 

gcg acg gtt etc gta ggg gaa tac ctg tct att tea ttc tac gga ggc 521 

Ala Thr Val Leu Val Gly Glu Tyr Leu Ser He Ser Phe Tyr Gly Gly 

95 100 105 

ttt ccc ttc tgg gga ggc ttg tgg ttt ate att tea gga tct etc tec 569 

Phe Pro Phe Trp Gly Gly Leu Trp Phe He He Ser Gly Ser Leu Ser 

110 115 120 

gtg gca gca gaa aat cag cca tat tct tat tgc ctg ctg tct ggc agt 617 

Val Ala Ala Glu Asn Gin Pro Tyr Ser Tyr Cys Leu Leu Ser Gly Ser 
125 130 135 

ttg ggc ttg aac ate gtc agt gca ate tgc tct gca gtt gga gtc ata 665 

Leu Gly Leu Asn He Val Ser Ala He Cys Ser Ala Val Gly Val He 
140 145 150 



etc ttc ate aca gat eta agt att ccc cac cca tat gee tac ccc gac 713 
Leu Phe He Thr Asp Leu Ser He Pro His Pro Tyr Ala Tyr Pro Asp 
155 160 165 170 

tat tat cct tac gee tgg ggt gtg aac cct gga atg gcg att tct ggc 761 
Tyr Tyr Pro Tyr Ala Trp Gly Val Asn Pro Gly Met Ala He Ser Gly 

175 180 185 

gtg ctg ctg gtc ttc tgc etc ctg gag ttt ggc ate gca tgc gca tct 809 
Val Leu Leu Val Phe Cys Leu Leu Glu Phe Gly He Ala Cys Ala Ser 

190 195 200 

tec cac ttt ggc tgc cag ttg gtc tgc tgt caa tea age aat gtg agt 857 
Ser His Phe Gly Cys Gin Leu Val Cys Cys Gin Ser Ser Asn Val Ser 
205 210 215 

gtc ate tat cca aac ate tat gca gca aac cca gtg ate acc cca gaa 905 
Val He Tyr Pro Asn He Tyr Ala Ala Asn Pro Val He Thr Pro Glu 
220 225 230 

ccg gtg acc tea cca cca agt tat tec agt gag ate caa gca aat aag 953 
Pro Val Thr Ser Pro Pro Ser Tyr Ser Ser Glu He Gin Ala Asn Lys 
235 240 245 250 

taa ggctacagat tctggaagca tctttcactg ggaccaaaag aagtcctcct 1006 

ccctttctgg gcttccataa cccaggtcgt tcctgttctg acagctgagg aaaegtctet 1066 

cccactgttt gtactctcac cttcattctt caattcagtc taggaaacca tgctgtttct 1126 

ctatcaagaa gaagacagag attttaaaca gatgttaacc aagagggact ccctagggca 1186 

catgeatcag cacatatgtg ggcatccagc ctctggggcc ttggcacaca cacattegtg 1246 

tgctctgctg catgtgagct tgtgggttag aggaacaaat atctagacat tcaatcttca 13 06 

ctctttcaat tgtgcattca tttaataaat agatactgag cattcaaaaa aaaaaaaaaa 1366 



aaa 



1369 
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<210> 16 

<211> 250 

<212> PRT 

<213> Homo sapiens 



<400> 16 

Met Asn Ser Met Thr Ser Ala Val Pro Val Ala Asn Ser Val Leu Val 
15 10 15 



Val Ala Pro His Asn Gly Tyr Pro Val Thr Pro Gly lie Met Ser His 

20 25 30 



Val Pro Leu Tyr Pro Asn Ser Gin Pro Gin Val His Leu Val Pro Gly 
35 40 45 



Asn Pro Pro Ser Leu Val Ser Asn Val Asn Gly Gin Pro Val Gin Lys 
50 55 60 



Ala Leu Lys Glu Gly Lys Thr Leu Gly Ala lie Gin lie lie lie Gly 
65 70 75 80 



Leu Ala His lie Gly Leu Gly Ser lie Met Ala Thr Val Leu Val Gly 

85 90 95 



Glu Tyr Leu Ser lie Ser Phe Tyr Gly Gly Phe Pro Phe Trp Gly Gly 

100 105 110 



Leu Trp Phe lie lie Ser Gly Ser Leu Ser Val Ala Ala Glu Asn Gin 
115 120 125 



Pro Tyr Ser Tyr Cys Leu Leu Ser Gly Ser Leu Gly Leu Asn lie Val 
130 135 140 



Ser Ala lie Cys Ser Ala Val Gly Val lie Leu Phe lie Thr Asp Leu 
145 150 155 160 



Ser lie Pro His Pro Tyr Ala Tyr Pro Asp Tyr Tyr Pro Tyr Ala Trp 

165 170 175 



Gly Val Asn Pro Gly Met Ala lie Ser Gly Val Leu Leu Val Phe Cys 

180 185 190 
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Leu Leu Glu Phe Gly lie Ala Cys Ala Ser Ser His Phe Gly Cys Gin 
195 200 205 



Leu Val Cys Cys Gin Ser Ser Asn Val Ser Val lie Tyr Pro Asn lie 
210 215 220 



Tyr Ala Ala Asn Pro Val lie Thr Pro Glu Pro Val Thr Ser Pro Pro 
225 230 235 240 



Ser Tyr Ser Ser Glu lie Gin Ala Asn Lys 

245 250 



<210> 17 

<211> 726 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (1) . . (726) 



<400> 17 

atg aaa gca gaa gcc aca gtt att ccc age cgt tgt get agg ggg etc 48 

Met Lys Ala Glu Ala Thr Val lie Pro Ser Arg Cys Ala Arg Gly Leu 
15 10 15 

cca tea tgg caa gtc etc age cca gtc cag ccc tgg cag aca agt gca 96 

Pro Ser Trp Gin Val Leu Ser Pro Val Gin Pro Trp Gin Thr Ser Ala 

20 25 30 

ccc cag aac acg ace cag ccc aag etc ctg get cca cac cag cac gag 144 

Pro Gin Asn Thr Thr Gin Pro Lys Leu Leu Ala Pro His Gin His Glu 

35 40 45 

aag tec cag aag aag age age ctt ctt aag gag ctg ggg gcc ttc cac 192 

Lys Ser Gin Lys Lys Ser Ser Leu Leu Lys Glu Leu Gly Ala Phe His 
50 55 60 

ate acc ate get ctg ctg cac ctg gtc ttt ggg ggc tac ctg gcc tct 240 

lie Thr lie Ala Leu Leu His Leu Val Phe Gly Gly Tyr Leu Ala Ser 
65 70 75 80 

ata gtc aag aac ctt cac ctg gtg gtg ctg aag tct tgg tat cca ttc 288 

lie Val Lys Asn Leu His Leu Val Val Leu Lys Ser Trp Tyr Pro Phe 

85 90 95 
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tgg ggg get gec tct ttt etc att tea ggg ate ttg gcg ata aca atg 33 6 

Trp Gly Ala Ala Ser Phe Leu lie Ser Gly lie Leu Ala lie Thr Met 

100 105 110 

aag ace ttt tct aaa act tac ctg aag atg ttg tgc ctg atg aca aac 384 
Lys Thr Phe Ser Lys Thr Tyr Leu Lys Met Leu Cys Leu Met Thr Asn 
115 120 125 

etc ate age etc ttt tgc gtg ctg tct ggc etc ttc gtc ate tec aag 432 
Leu lie Ser Leu Phe Cys Val Leu Ser Gly Leu Phe Val lie Ser Lys 
130 135 140 

gat etc ttt ctg gag age cca ttt gag tec ccg ate tgg aga atg tac 480 
Asp Leu Phe Leu Glu Ser Pro Phe Glu Ser Pro lie Trp Arg Met Tyr 
145 150 155 160 

ccc aac tec acg gtc cac ate cag agg ctg gag ctg gee ttg etc tgc 52 8 

Pro Asn Ser Thr Val His lie Gin Arg Leu Glu Leu Ala Leu Leu Cys 

165 170 175 

ttc act gtc eta gag etc ttc ctg cca gtg ccc aca get gtc aca gee 576 
Phe Thr Val Leu Glu Leu Phe Leu Pro Val Pro Thr Ala Val Thr Ala 

180 185 190 

tgg aga ggg gac tgc cca tct gca aag aat gat gat gca tgc ctt gtt 624 
Trp Arg Gly Asp Cys Pro Ser Ala Lys Asn Asp Asp Ala Cys Leu Val 
195 200 205 

ccg aat aca cca ttg cat etc aaa ggc ctg ccg gtg gag ccc ccg cca 672 
Pro Asn Thr Pro Leu His Leu Lys Gly Leu Pro Val Glu Pro Pro Pro 
210 215 220 

tec tac cag agt gtg att caa ggc gac gca caa cac aag caa cat cag 720 
Ser Tyr Gin Ser Val lie Gin Gly Asp Ala Gin His Lys Gin His Gin 
225 230 235 240 



agg tga 
Arg 



<210> 18 

<211> 241 

<212> PRT 

<213> Homo sapiens 



<400> 18 

Met Lys Ala Glu Ala Thr Val lie Pro Ser Arg Cys Ala Arg Gly Leu 
15 10 15 



Pro Ser Trp Gin Val Leu Ser Pro Val Gin Pro Trp Gin Thr Ser Ala 

20 25 30 



Pro Gin Asn Thr Thr Gin Pro Lys Leu Leu Ala Pro His Gin His Glu 



726 
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35 40 45 



Lys Ser Gin Lys Lys Ser Ser Leu Leu Lys Glu Leu Gly Ala Phe His 
50 55 60 



lie Thr lie Ala Leu Leu His Leu Val Phe Gly Gly Tyr Leu Ala Ser 
65 70 75 80 



lie Val Lys Asn Leu His Leu Val Val Leu Lys Ser Trp Tyr Pro Phe 

85 90 95 



Trp Gly Ala Ala Ser Phe Leu lie Ser Gly lie Leu Ala lie Thr Met 

100 105 110 



Lys Thr Phe Ser Lys Thr Tyr Leu Lys Met Leu Cys Leu Met Thr Asn 
115 120 125 



Leu lie Ser Leu Phe Cys Val Leu Ser Gly Leu Phe Val lie Ser Lys 
130 135 140 



Asp Leu Phe Leu Glu Ser Pro Phe Glu Ser Pro lie Trp Arg Met Tyr 
145 150 155 160 



Pro Asn Ser Thr Val His lie Gin Arg Leu Glu Leu Ala Leu Leu Cys 

165 170 175 



Phe Thr Val Leu Glu Leu Phe Leu Pro Val Pro Thr Ala Val Thr Ala 

180 185 190 



Trp Arg Gly Asp Cys Pro Ser Ala Lys Asn Asp Asp Ala Cys Leu Val 
195 200 205 



Pro Asn Thr Pro Leu His Leu Lys Gly Leu Pro Val Glu Pro Pro Pro 
210 215 220 



Ser Tyr Gin Ser Val lie Gin Gly Asp Ala Gin His Lys Gin His Gin 
225 230 235 240 



Arg 

<210> 19 

<211> 518 

<212> DNA 
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<213> Mus musculus 



<220> 

<221> cDNA 
<222> (1) . . (518) 



<220> 

<221> CDS 

<222> (148) . . (516) 



<400> 19 

taagcgctga ggaggaagag actgctggtt ttgggggaca gactctggtg gtcattactg 60 

tctcctcttc tgtaatgagt tggacttgca ggggaaggac ttgtcaaatc ctgaattctt 12 0 

caaggtggta cctcatcagg cagacac atg aag cca gag gag act ggt ggt tct 174 

Met Lys Pro Glu Glu Thr Gly Gly Ser 
1 5 

gtt tat cag ccc ttg gat gag tea cgc cat gtt caa cga ggt gta ctg 222 
Val Tyr Gin Pro Leu Asp Glu Ser Arg His Val Gin Arg Gly Val Leu 
10 15 20 25 

caa gec etc ggg gec ate cag ate ctg aat gga ata ctg att eta get 270 
Gin Ala Leu Gly Ala lie Gin lie Leu Asn Gly lie Leu lie Leu Ala 

30 35 40 

etc gga att ttt ctg gtt tgt tta caa cac gtg tec cac cac ttc agg 318 
Leu Gly lie Phe Leu Val Cys Leu Gin His Val Ser His His Phe Arg 

45 50 55 

cat ttc ttc ttc ttc ace ttc tac aca ggc tac cca ctg tgg ggt get 366 
His Phe Phe Phe Phe Thr Phe Tyr Thr Gly Tyr Pro Leu Trp Gly Ala 
60 65 70 

gtg ttt ttt ate age tea gga tec ttg act gtt gee gca ggg aga aac 414 
Val Phe Phe lie Ser Ser Gly Ser Leu Thr Val Ala Ala Gly Arg Asn 
75 80 85 

ccc aca cga atg ctg atg caa aac agt ttt ggg ata aac att gee agt 462 
Pro Thr Arg Met Leu Met Gin Asn Ser Phe Gly lie Asn lie Ala Ser 
90 95 100 105 



act aca att gca ttt gtt ggg act gtt ttc ctt tct gtg cat ttg gca 510 
Thr Thr lie Ala Phe Val Gly Thr Val Phe Leu Ser Val His Leu Ala 

110 115 120 



ttc aat ac 
Phe Asn 



518 
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<210> 20 

<211> 123 

<212> PRT 

<213> Mus musculus 



<400> 20 

Met Lys Pro Glu Glu Thr Gly Gly Ser Val Tyr Gin Pro Leu Asp Glu 
15 10 15 



Ser Arg His Val Gin Arg Gly Val Leu Gin Ala Leu Gly Ala lie Gin 

20 25 30 



lie Leu Asn Gly lie Leu lie Leu Ala Leu Gly lie Phe Leu Val Cys 
35 40 45 



Leu Gin His Val Ser His His Phe Arg His Phe Phe Phe Phe Thr Phe 
50 55 60 



Tyr Thr Gly Tyr Pro Leu Trp Gly Ala Val Phe Phe lie Ser Ser Gly 
65 70 75 80 



Ser Leu Thr Val Ala Ala Gly Arg Asn Pro Thr Arg Met Leu Met Gin 

85 90 95 



Asn Ser Phe Gly lie Asn lie Ala Ser Thr Thr lie Ala Phe Val Gly 

100 105 110 



Thr Val Phe Leu Ser Val His Leu Ala Phe Asn 
115 120 



<210> 21 

<211> 1228 

<212> DNA 

<213> Mus musculus 



<220> 

<221> CDNA 
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<222> (1) . . (1228) 



<220> 

<221> CDS 

<222> (98) . . (778) 



<400> 21 

ttcggcacga ggcacacaag caaagcttct cctggaataa atcagctgac atctgatcct 60 

ttccattgtc tgtactgttt ctgctgctct ggaaacc atg caa gga cag gaa cag 115 

Met Gin Gly Gin Glu Gin 
1 5 

acc acc atg gca gtg gtt cct gga gtt get gtg cct tea aag aat tct 163 
Thr Thr Met Ala Val Val Pro Gly Val Ala Val Pro Ser Lys Asn Ser 

10 15 20 

gtt atg aca tea caa atg tgg aat gag aag aaa gag aaa ttc ttg aag 211 
Val Met Thr Ser Gin Met Trp Asn Glu Lys Lys Glu Lys Phe Leu Lys 
25 30 35 

999 9 aa ccc aaa 9 tc ctt 999 9 tt: tta caa 9^9 at 9 att 9 ct atc ata 259 
Gly Glu Pro Lys Val Leu Gly Val Leu Gin Val Met lie Ala lie lie 

40 45 50 

aac etc age tta gga ata ata att ttg aca act tta ttt tct gaa eta 307 
Asn Leu Ser Leu Gly lie lie lie Leu Thr Thr Leu Phe Ser Glu Leu 
55 60 65 70 

ccc act tea gtg atg tta atg gtc cca att tgg gga tea ata atg ttc 355 
Pro Thr Ser Val Met Leu Met Val Pro lie Trp Gly Ser lie Met Phe 

75 80 85 

att gtc tec gga tec ctg tec att gca gca gga gtg aca cct aca aaa 403 
lie Val Ser- Gly Ser Leu Ser lie Ala Ala Gly Val Thr Pro Thr Lys 

90 95 100 

tgc ctg atc gtt gee agt eta act ctg aac act atc acc tct gtg ttg 451 
Cys Leu lie Val Ala Ser Leu Thr Leu Asn Thr lie Thr Ser Val Leu 
105 110 115 

get gca act gca age ata atg ggt gta gtc agt gtg get gtg ggt tea 499 
Ala Ala Thr Ala Ser lie Met Gly Val Val Ser Val Ala Val Gly Ser 
120 125 130 

cag ttt ccg ttt egg tat aat tat aca atc acc aag ggt ttg gat gtt 547 
Gin Phe Pro Phe Arg Tyr Asn Tyr Thr lie Thr Lys Gly Leu Asp Val 
135 140 145 150 

ttg atg tta att ttc aat atg eta gaa ttc tgc ctt get gtg tec gtc 595 
Leu Met Leu lie Phe Asn Met Leu Glu Phe Cys Leu Ala Val Ser Val 

155 160 165 

tct gca ttt gga tgt gaa get tec tgt tgt aac tec cgt gag gtt ctt 643 
Ser Ala Phe Gly Cys Glu Ala Ser Cys Cys Asn Ser Arg Glu Val Leu 
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170 175 180 

gta gtg eta cca tea aat cct gtt gag aca gtg atg gca ccc cca atg 691 
Val Val Leu Pro Ser Asn Pro Val Glu Thr Val Met Ala Pro Pro Met 
185 190 195 

aca ctt caa cca ttg eta cca tea gaa cac caa ggg acc aat gtt cca 739 
Thr Leu Gin Pro Leu Leu Pro Ser Glu His Gin Gly Thr Asn Val Pro 
200 205 210 

gga aat gtg tac aag aac cac cca gga gaa ata gtc taa ttttgatgtg 788 
Gly Asn Val Tyr Lys Asn His Pro Gly Glu lie Val 
215 220 225 

tgtgtgtgtg tatttcccta ggatattaac acttcattgc actggctttt gaggtgaata 848 

ttagatttac tgtaagtatg taagtcaagc acttattagg tcaacaacac ttcaacatat 908 

tatattcatt gtatgtacaa ggggcaatga atttgeaaag atgttttgaa agcaaacaga 968 

aaaaaaaaaa aaacaaccaa acaaaagacc tcttagtgaa atgaggtctc tttgeaaaga 102 8 

ctaaaaaact ggagttcaca tttttggggg tggggggggc ttttgctaaa taagtagatt 108 8 

« 

tagatgettt tgatcaagta caaactcata aagtatgtaa gaaattacta atgataggaa 1148 

ccaattttaa tctcatatgt aacatagtgt ataatttaat agatctggta aaaatttata 1208 

ataaagagaa atgcctgaaa 1228 

<210> 22 

<211> 226 

<212> PRT 

<213> Mus musculus 



<400> 22 

Met Gin Gly Gin Glu Gin Thr Thr Met Ala Val Val Pro Gly Val Ala 
15 10 15 



Val Pro Ser Lys Asn Ser Val Met Thr Ser Gin Met Trp Asn Glu Lys 

20 25 30 



Lys Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Val Leu Gin 
35 40 45 



Val Met lie Ala lie lie Asn Leu Ser Leu Gly lie lie lie Leu Thr 
50 55 60 



Thr Leu Phe Ser Glu Leu Pro Thr Ser Val Met Leu Met Val Pro lie 
65 70 75 80 
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Trp Gly Ser lie Met Phe lie Val Ser Gly Ser Leu Ser lie Ala Ala 

85 90 95 



Gly Val Thr Pro Thr Lys Cys Leu lie Val Ala Ser Leu Thr Leu Asn 

100 105 110 



Thr lie Thr Ser Val Leu Ala Ala Thr Ala Ser lie Met Gly Val Val 
115 120 125 



Ser Val Ala Val Gly Ser Gin Phe Pro Phe Arg Tyr Asn Tyr Thr lie 
130 135 140 



Thr Lys Gly Leu Asp Val Leu Met Leu lie Phe Asn Met Leu Glu Phe 
145 150 155 160 



Cys Leu Ala Val Ser Val Ser Ala Phe Gly Cys Glu Ala Ser Cys Cys 

165 170 175 



Asn Ser Arg Glu Val Leu Val Val Leu Pro Ser Asn Pro Val Glu Thr 

180 185 190 



Val Met Ala Pro Pro Met Thr Leu Gin Pro Leu Leu Pro Ser Glu His 
195 200 205 



Gin Gly Thr Asn Val Pro Gly Asn Val Tyr Lys Asn His Pro Gly Glu 
210 215 220 



lie Val 
225 



<210> 23 

<211> 1480 

<212> DNA 

<213> Mus musculus 



<220> 

<221> cDNA 

<222> (1) . . (1480) 
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<220> 

<221> CDS 

<222> (107) . . (787) 



<400> 23 

attcggcacg atgggtgaga acacacaatc aaaacttctc ctggaatata tcatctgaca 60 

tcagatcctt cccattgtct gtactgtttc tgctgccctg gaaacc atg caa gga 115 

Met Gin Gly 
1 

cag gaa cag acc acc atg gca gtg gtt cct gga ggt get cca cct tea 163 
Gin Glu Gin Thr Thr Met Ala Val Val Pro Gly Gly Ala Pro Pro Ser 
5 10 15 

gag aat tct gtt atg aaa tea caa atg tgg aac gag aac aag gag aaa 211 
Glu Asn Ser Val Met Lys Ser Gin Met Trp Asn Glu Asn Lys Glu Lys 
20 25 30 35 

ttc ttg aag ggg gaa ccc aaa gtc ctt ggg gtt gta caa gtt atg att 259 
Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Val Val Gin Val Met lie 

40 45 50 

get etc ata aac etc age ttc gga ata ata att ttg gca aat eta tct 307 
Ala Leu lie Asn Leu Ser Phe Gly He He He Leu Ala Asn Leu Ser 

55 60 65 



tct gaa cca etc att tct gtg gtc tta atg get cca att tgg gga cca 355 
Ser Glu Pro Leu He Ser Val Val Leu Met Ala Pro He Trp Gly Pro 
70 75 80 

ata atg ttc att gtc tea gga tec ctg tea att gca gca gga gtg aga 403 
He Met Phe He Val Ser Gly Ser Leu Ser He Ala Ala Gly Val Arg 
85 90 95 

cct aca aaa aag ctg ate ate age agt eta act ctg aac act ate acc 451 
Pro Thr Lys Lys Leu He He Ser Ser Leu Thr Leu Asn Thr lie Thr 
100 105 110 115 

tct gtg ttg get gca act gca age ata atg ggt gta gtc agt gtg get 499 
Ser Val Leu Ala Ala Thr Ala Ser He Met Gly Val Val Ser Val Ala 

120 125 130 

9tg ggt tea cag ttt ccg ttt egg tat aat tat aca ate acc aag ggt 547 
Val Gly Ser Gin Phe Pro Phe Arg Tyr Asn Tyr Thr He Thr Lys Gly 

135 140 145 

ttg gat att ttg atg tta att tta aat atg eta gaa ttc tgc att get 595 
Leu Asp He Leu Met Leu He Leu Asn Met Leu Glu Phe Cys He Ala 
150 155 160 

gtg tec ate tct get ttt gga tgt aaa get tec tgt tgt aac tec age 643 
Val Ser He Ser Ala Phe Gly Cys Lys Ala Ser Cys Cys Asn Ser Ser 
165 170 175 

gag gtt ctt gta gtg eta cca tea aat cct get gtg act gtg atg gca 691 
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Glu Val Leu Val Val Leu Pro Ser Asn Pro Ala Val Thr Val Met Ala 
180 185 190 195 

ccc cct gtg acg ctt caa cca ttg cca cca tea gaa cac caa ggg aaa 739 
Pro Pro Val Thr Leu Gin Pro Leu Pro Pro Ser Glu His Gin Gly Lys 

200 205 210 

aat gtt cca gaa aat gta tat aag aac cac tea gaa gaa ata gtc taa 787 
Asn Val Pro Glu Asn Val Tyr Lys Asn His Ser Glu Glu lie Val 

220 225 

-gtgtgtgtg tgtgtgtgta tgcatttccc 847 

-tggggggaa tatcagacat gecataaett 907 

actttaacat cctccattaa ttgtatgtac 967 

aaagcaaaat gaaacaaaca aacaagecag 1027 

:aaagattaa agaaaggegt tcacatttct 1087 

rtgcttttga tcatgtacaa acacatgaat 1147 

:aatttttat ctctgtgcat attcatcata 1207 

jgtaaaaatt ataataaaga gaagtgactg 1267 

icacatttac atacatagtg tatatctctg 1327 

:agagattat agaaggtata catacacata 1387 

:acttataac tgtggtttgg tgctgtaatc 1447 

?tc 1480 





215 




ttctcttgtg 


tgtatgtgaa 


tgtgtgtgtg 


tgggaaatta 


acacttcatt 


gccctggctt 


tgtaaatcta 


acatctacta 


gtacaacaac 


aaggggcaat 


gaattagcaa 


agatgttttg 


cacacaaaaa 


aagacctgtt 


ttgtcttttg 


ttggggcttt 


tgctaaataa 


gtggatttag 


tatgtatatt 


atattactaa 


tgataggaac 


taataacata 


ctctatactg 


taattgatat 


aaa ueggaac 


acacaaacac 


aCaCaCaCaC 


tatatacata 


gtacgtgtat 


ggtgtgcatg 


tgtgtacata 


ttacatatga 


cctaaaaaat 


tcccctaagg 


ctcacatctt 


tgaacagttg 


<210> 24 






<211> 226 






<212> PRT 






<213> Mus 


musculus 





<400> 24 

Met Gin Gly Gin Glu Gin Thr Thr Met Ala Val Val Pro Gly Gly Ala 
15 10 15 



Pro Pro Ser Glu Asn Ser Val Met Lys Ser Gin Met Trp Asn Glu Asn 

20 25 30 



Lys Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Val Val Gin 
35 40 45 



Val Met lie Ala Leu lie Asn Leu Ser Phe Gly lie lie lie Leu Ala 
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50 55 60 



Asn Leu Ser Ser Glu Pro Leu lie Ser Val Val Leu Met Ala Pro lie 
65 70 75 80 



Trp Gly Pro lie Met Phe lie Val Ser Gly Ser Leu Ser lie Ala Ala 

85 90 95 



Gly Val Arg Pro Thr Lys Lys Leu lie lie Ser Ser Leu Thr Leu Asn 

100 105 110 



Thr lie Thr Ser Val Leu Ala Ala Thr Ala Ser lie Met Gly Val Val 
115 120 125 



Ser Val Ala Val Gly Ser Gin Phe Pro Phe Arg Tyr Asn Tyr Thr lie 
130 135 140 



Thr Lys Gly Leu Asp lie Leu Met Leu lie Leu Asn Met Leu Glu Phe 
145 150 155 160 



Cys lie Ala Val Ser lie Ser Ala Phe Gly Cys Lys Ala Ser Cys Cys 

165 170 175 



Asn Ser Ser Glu Val Leu Val Val Leu Pro Ser Asn Pro Ala Val Thr 

180 185 190 



Val Met Ala Pro Pro Val Thr Leu Gin Pro Leu Pro Pro Ser Glu His 
195 200 205 



Gin Gly Lys Asn Val Pro Glu Asn Val Tyr Lys Asn His Ser Glu Glu 
210 215 220 



lie Val 
225 



<210> 25 

<211> 1187 

<212> DNA 

<213> Mus musculus 



<220> 

<221> cDNA 
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<222> (1) . . (1187) 



<220> 

<221> CDS 

<222> (36) . . (713) 



<400> 25 

ccattgtctg tactgtttct gctgccctgg aaacc atg caa gga ctg gca cag 53 

Met Gin Gly Leu Ala Gin 
1 5 

acc acc atg gca gtg gtt cct gga ggt get cca cct tea gag aat tct 101 
Thr Thr Met Ala Val Val Pro Gly Gly Ala Pro Pro Ser Glu Asn Ser 

10 15 20 

gtt ata aaa tea caa atg tgg aac aag aac aaa gag aaa ttc ttg aag 149 
Val lie Lys Ser Gin Met Trp Asn Lys Asn Lys Glu Lys Phe Leu Lys 
25 30 35 

ggg gaa ccc aaa gtc etc ggg get ata caa gtt atg att get ttc ata 197 
Gly Glu Pro Lys Val Leu Gly Ala lie Gin Val Met lie Ala Phe lie 
40 45 50 

aac ttc age tta gga ata ata att ata tta aat aga gtt tct gaa cga 245 
Asn Phe Ser Leu Gly lie lie lie lie Leu Asn Arg Val Ser Glu Arg 
55 60 65 70 

ttc atg tea gtg etc tta ctg gee cca ttt tgg gga tea ata atg ttc 293 
Phe Met Ser Val Leu Leu Leu Ala Pro Phe Trp Gly Ser lie Met Phe 

75 80 85 

att ttc tea gga tec ctg tea att gca gca gga gtg aaa cct aca aaa 341 
lie Phe Ser Gly Ser Leu Ser lie Ala Ala Gly Val Lys Pro Thr Lys 

90 95 100 

gec atg ate ate age agt eta agt gtg aac act ate agt tct gtg ttg 3 89 

Ala Met lie lie Ser Ser Leu Ser Val Asn Thr He Ser Ser Val Leu 
105 110 115 

get gtg gca gca age att att ggc gtc ate agt gtg att tct ggt gtt 437 
Ala Val Ala Ala Ser He He Gly Val He Ser Val He Ser Gly Val 
120 125 130 



ttt cgc caa ttt aga agt caa cca gee ate get agt ttg gat gtt ttg 485 
Phe Arg Gin Phe Arg Ser Gin Pro Ala He Ala Ser Leu Asp Val Leu 
135 140 145 150 

atg aca att ttg aat atg eta gaa ttc tgc att get gtg tec gtc tct 533 
Met Thr He Leu Asn Met Leu Glu Phe Cys He Ala Val Ser Val Ser 

155 160 165 

gca ttt ggg tgt aaa get tec tgt tgt aac tec agt gag gtt ctt gta 581 
Ala Phe Gly Cys Lys Ala Ser Cys Cys Asn Ser Ser Glu Val Leu Val 

170 175 180 
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gtg eta cca tea aat tct get gtg aca gtg aca gca ccc ccc atg ata 629 
Val Leu Pro Ser Asn Ser Ala Val Thr Val Thr Ala Pro Pro Met lie 
185 190 195 

ctt caa cca ttg cca cca tea gaa tgc caa ggg aaa aat gtt cca gaa 677 
Leu Gin Pro Leu Pro Pro Ser Glu Cys Gin Gly Lys Asn Val Pro Glu 
200 205 210 

aat eta tac agg aac caa cca gga gaa ata gtc taa ttttgatgta 723 
Asn Leu Tyr Arg Asn Gin Pro Gly Glu lie Val 

783 
843 
903 
963 
1023 
1083 
1143 
1187 



215 


220 


225 






cgtgttttca 


tgtgtgagag 


tgtgtgtgtg 


cacatgtgta 


tgcatttggc 


tttttgtacg 


aatataagac 


atgctctaaa 


taagtaagtc 


aagcatttat 


taagtcaata 


acactttaaa 


atcttctatt 


cattgtatgt 


acaagaagaa 


tttaattggc 


aaaaaaaatt 


ttaatgtaaa 


ctgataaaaa 


ccttggttta 


gtgaggtgaa 


gtctccaaag 


agtgaagaat 


ggagttcatg 


ggtttttttt 


gggggggggt 


tgctaaataa 


ataaatttgg 


ctgctcacga 


tcatatacag 


acacataaac 


tatgtaagac 


agcaggtgta 


ttaccttact 


aataatagca 


accaatttta 


atcctacatg 


aatgttcatc 


ttaatagtat 


atgccttaat 


tgttatggtg 


aaaagttaca 


ataaacaaaa 


gtgattgaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaag 





<210> 26 

<211> 225 

<212> PRT 

<213> Mus musculus 



<400> 26 

Met Gin Gly Leu Ala Gin Thr Thr Met Ala Val Val Pro Gly Gly Ala 
15 10 15 



Pro Pro Ser Glu Asn Ser Val lie Lys Ser Gin Met Trp Asn Lys Asn 

20 25 30 



Lys Glu Lys Phe Leu Lys Gly Glu Pro Lys Val Leu Gly Ala lie Gin 
35 40 45 



Val Met lie Ala Phe lie Asn Phe Ser Leu Gly lie lie lie lie Leu 
50 55 60 



Asn Arg Val Ser Glu Arg Phe Met Ser Val Leu Leu Leu Ala Pro Phe 
65 70 75 80 
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Trp Gly Ser lie Met Phe lie Phe Ser Gly Ser Leu Ser lie Ala Ala 

85 90 95 



Gly Val Lys Pro Thr Lys Ala Met lie lie Ser Ser Leu Ser Val Asn 

100 105 110 



Thr lie Ser Ser Val Leu Ala Val Ala Ala Ser lie lie Gly Val lie 
115 120 125 



Ser Val lie Ser Gly Val Phe Arg Gin Phe Arg Ser Gin Pro Ala lie 
130 135 140 



Ala Ser Leu Asp Val Leu Met Thr lie Leu Asn Met Leu Glu Phe Cys 
145 150 155 160 



lie Ala Val Ser Val Ser Ala Phe Gly Cys Lys Ala Ser Cys Cys Asn 

165 170 175 



Ser Ser Glu Val Leu Val Val Leu Pro Ser Asn Ser Ala Val Thr Val 

180 185 190 



Thr Ala Pro Pro Met lie Leu Gin Pro Leu Pro Pro Ser Glu Cys Gin 
195 200 205 



Gly Lys Asn Val Pro Glu Asn Leu Tyr Arg Asn Gin Pro Gly Glu lie 
210 215 220 



Val 
225 



<210> 27 

<211> 1687 

<212> DNA 

<213> Mus musculus 



<220> 

<221> cDNA 

<222> (1) . . (1687) 
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45 



<220> 

<221> CDS 

<222> (133) . . (867) 



<400> 27 

10 gttacaagtt cacaagaagg aaccaaggat cagcctgaga gaacccagag ttaaggctct 60 

tcgggttcct gagagctcgg ctggaagtga ctgggtgaca aggcacacag gctcagccgt 12 0 

gggagctcaa tc atg att cca caa gta gtg acc agt gag act gtg gca atg 171 
15 Met lie Pro Gin Val Val Thr Ser Glu Thr Val Ala Met 

15 10 

att teg cca aat gga atg agt ctt ccc caa aca gac aaa ccc cag cct 219 
lie Ser Pro Asn Gly Met Ser Leu Pro Gin Thr Asp Lys Pro Gin Pro 
20 15 20 25 

ttc cac cag tgg caa gac age ctg aag aaa cat eta aag get gag ate 267 

Phe His Gin Trp Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu lie 

30 35 40 45 

25 

aaa gtg atg gcg gca ate cag ate atg tgt get gtg atg gtg ttg agt 315 

Lys Val Met Ala Ala lie Gin lie Met Cys Ala Val Met Val Leu Ser 

50 55 60 

,30 ctg gga ate att ttg gca tct gtt ccc tec aat eta cac ttt acc tea 363 

Leu Gly lie lie Leu Ala Ser Val Pro Ser Asn Leu His Phe Thr Ser 

65 70 75 

gtg ttt tea gtc ctg tta aaa tct ggc tac cca ttt ata gga get ttg 411 
35 Val Phe Ser Val Leu Leu Lys Ser Gly Tyr Pro Phe lie Gly Ala Leu 

80 85 90 

ttt ttt ata gtc tct gga att ctg tec ate gtc acg gag aca aag tea 459 
Phe Phe lie Val Ser Gly lie Leu Ser lie Val Thr Glu Thr Lys Ser 
40 95 100 105 

aca aaa att ttg gta gac age age ctg act ctg aat ate ctg agt gtt 507 
Thr Lys lie Leu Val Asp Ser Ser Leu Thr Leu Asn lie Leu Ser Val 
110 115 120 125 



tea ttt get ttc atg ggc ate att ate ate tct gtc age ctg get ggt 555 
Ser Phe Ala Phe Met Gly lie lie lie lie Ser Val Ser Leu Ala Gly 

130 135 140 



50 ttg cat cct gec tea gag cag tgc ttg cag age aag gag ctt aga cca 603 

Leu His Pro Ala Ser Glu Gin Cys Leu Gin Ser Lys Glu Leu Arg Pro 

145 150 155 

act gaa tat cat tac tac caa ttc ttg gac agg aac gag tgc ttt gee 651 
55 Thr Glu Tyr His Tyr Tyr Gin Phe Leu Asp Arg Asn Glu Cys Phe Ala 

160 165 170 

gec aag tct gtt ctg get gga gtc ttt tea ctg atg ctg ate agt act 699 
Ala Lys Ser Val Leu Ala Gly Val Phe Ser Leu Met Leu lie Ser Thr 
60 175 180 185 
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atg ttg gaa ctt ggc ctg get gtc etc act gee atg ctg tgg tgg aaa 747 
Met Leu Glu Leu Gly Leu Ala Val Leu Thr Ala Met Leu Trp Trp Lys 
190 195 200 205 

5 cag agt cac tct aac ate cct ggg aat gtt atg ttc ctg cca cat age 795 

Gin Ser His Ser Asn lie Pro Gly Asn Val Met Phe .Leu Pro His Ser 

210 215 220 

tea aat aat gac tec aac atg gaa tea aag gta ctt tgt aac ccc tea 843 
10 Ser Asn Asn Asp Ser Asn Met Glu Ser Lys Val Leu Cys Asn Pro Ser 

225 230 235 

tat gag gaa caa ttg gtt tgt taa gaaaaacaaa acaaaacaaa actaaatacc 897 
Tyr Glu Glu Gin Leu Val Cys 
15 240 

acaaaaacaa atggaactat acegcagaag atatgtcttc atgataatgc agaaattcca 957 

aceatcacag ggtagcaatg ettgetaett aaaatgtaga ctgttcatac agtgggtacc 1017 

agtatgagtt gaatgtgtgt attactggca ccctattgat tttcatgacc ttggcttcag 1077 

ccaaagccca gacctacaaa tggtggcctt tcttagaaaa ccaaacagaa tgtttcaggc 1137 

25 cattgtagtg gggaaaaggg acaacatttt cttgccccct gcaattaaca gcaacagtta 1197 

accattagca gtctttgtat tcagaatctc tgctatgacg tgggtcttct atcatatcag 1257 

ttttattcac tggtgtgaat aaacaaggga ectgeaatga agtctgaaga agtttccatg 1317 

ttgttggttc aaaataaagt ctttttgtga ttgtatattc tttttgtatg gggttttgtg 1377 

tgggttttgt tgttgttgtt gttttgcttt tatgeacaga tcaagtccta agtggtacat 1437 

35 aaacacatgt gtttgetatt ttttttttgt tctctatcaa ccaaaaaaaa aaaaaaaaaa 1497 

ggcatgggaa agagaaaaag gaatatatct gtgtctctgt gtttatgtgg tgtgtgtgtg 1557 

tgtgtgtttt aaactaaaac tgaatgaaca ttcaaagttt agcaatgtat tttggaggta 1617 

ccagataaca tacttatttc tagaatcgee agagcaaatg agagacattt tgtegcttet 1677 

tataagaggc 1687 

45 

<210> 28 
<211> 244 
50 <212> PRT 

<213> Mus musculus 



30 



40 



55 



60 



<400> 28 

Met lie Pro Gin Val Val Thr Ser Glu Thr Val Ala Met lie Ser Pro 
15 10 15 
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Asn Gly Met Ser Leu Pro Gin Thr Asp Lys Pro Gin Pro Phe His Gin 

20 25 30 



Trp Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu lie Lys Val Met 
35 40 45 



Ala Ala lie Gin lie Met Cys Ala Val Met Val Leu Ser Leu Gly lie 
50 55 60 



lie Leu Ala Ser Val Pro Ser Asn Leu His Phe Thr Ser Val Phe Ser 
65 70 75 80 



Val Leu Leu Lys Ser Gly Tyr Pro Phe lie Gly Ala Leu Phe Phe lie 

85 90 95 



Val Ser Gly lie Leu Ser lie Val Thr Glu Thr Lys Ser Thr Lys lie 

100 105 110 



Leu Val Asp Ser Ser Leu Thr Leu Asn lie Leu Ser Val Ser Phe Ala 
115 120 125 



Phe Met Gly lie lie lie lie Ser Val Ser Leu Ala Gly Leu His Pro 
130 135 140 



Ala Ser Glu Gin Cys Leu Gin Ser Lys Glu Leu Arg Pro Thr Glu Tyr 
145 150 155 160 



His Tyr Tyr Gin Phe Leu Asp Arg Asn Glu Cys Phe Ala Ala Lys Ser 

165 170 175 



Val Leu Ala Gly Val Phe Ser Leu Met Leu lie Ser Thr Met Leu Glu 

180 185 190 



Leu Gly Leu Ala Val Leu Thr Ala Met Leu Trp Trp Lys Gin Ser His 
195 200 205 



Ser Asn lie Pro Gly Asn Val Met Phe Leu Pro His Ser Ser Asn Asn 
210 215 220 



Asp Ser Asn Met Glu Ser Lys Val Leu Cys Asn Pro Ser Tyr Glu Glu 
225 230 235 240 



Gin Leu Val Cys 
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<210> 29 

<211> 899 

<212> DNA 

<213> Mus musculus 



<220> 

<221> cDNA 
<222> (1) . . (899) 



<220> 

<221> CDS 

<222> (164) . . (817) 



<400> 29 

aattcggcac taggaggaag ggctggagac agttacaagt tcacaagaag gaaccaagga 60 

tcagcctgag agaacccaga gttaaggctc ttagggtttc tgagagctcg gctggaagtg 120 

actgggtgac aaggcacaca ggctcagctg tggcagctcc ate atg att cca cag 175 

Met lie Pro Gin 
1 

gta gtg acc aat gag acc ate aca acg att tea cca aat gga ate aac 223 
Val Val Thr Asn Glu Thr lie Thr Thr lie Ser Pro Asn Gly lie Asn 
5 10 15 20 

ttt ccc caa aaa gac gag tec cag cct acc caa cag agg caa gac age 271 
Phe Pro Gin Lys Asp Glu Ser Gin Pro Thr Gin Gin Arg Gin Asp Ser 

25 30 35 

ctg aag aaa cat eta aag get gag ate aaa gtg ata gtg gca ate cag 319 
Leu Lys Lys His Leu Lys Ala Glu lie Lys Val lie Val Ala lie Gin 

40 45 50 

ate atg tgt get gtg aca gtg ttg get ctg gga ate att ttg gca tct 367 
lie Met Cys Ala Val Thr Val Leu Ala Leu Gly lie lie Leu Ala Ser 
55 60 65 

gtt cct cct gtc cca tat ttt aac tea gtg ttt tct gtc ctg tta aaa 415 
Val Pro Pro Val Pro Tyr Phe Asn Ser Val Phe Ser Val Leu Leu Lys 
70 75 80 

tct ggc tac cca ttt ata gga get ttg ttt ttt att gee tct gga att 463 
Ser Gly Tyr Pro Phe lie Gly Ala Leu Phe Phe lie Ala Ser Gly lie 
85 90 95 100 

ttg tec ate att acg gag aga aag tea aca aaa cct ttg gta gat gee 511 
Leu Ser lie lie Thr Glu Arg Lys Ser Thr Lys Pro Leu Val Asp Ala 
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105 110 115 

age eta act ctg aat ate ctg agt gtt tea ttt get ttc gtg ggc ate 559 
Ser Leu Thr Leu Asn lie Leu Ser Val Ser Phe Ala Phe Val Gly lie 

120 125 130 

att ate ate tct gtc age ctg get ggt ttg cat cct gee tea gag caa 607 
lie lie lie Ser Val Ser Leu Ala Gly Leu His Pro Ala Ser Glu Gin 
135 140 145 

tgc aag cag age aag gag ctt agt eta att gaa cat gac tac tac caa 655 
Cys Lys Gin Ser Lys Glu Leu Ser Leu lie Glu His Asp Tyr Tyr Gin 
150 155 160 

cct ttc tac aac tea gac agg agt gaa tgt gee gtc ace aag tct att 703 
Pro Phe Tyr Asn Ser Asp Arg Ser Glu Cys Ala Val Thr Lys Ser lie 
165 170 175 180 

ctg act gga gee ctt tea gtg atg ctg ate ate agt gtg ttg gag ctt 751 
Leu Thr Gly Ala Leu Ser Val Met Leu lie lie Ser Val Leu , Glu Leu 

185 190 195 

ggt ctg get ttg etc tct gee atg ctg tgg ttg aga gag ggt gtt ctg 799 
Gly Leu Ala Leu Leu Ser Ala Met Leu Trp Leu Arg Glu Gly Val Leu 

200 205 210 



act tec ctt aga atg tga ttttttctgt cttataactc caataatgaa 847 
Thr Ser Leu Arg Met 
215 

gccaatgtgg aatcaaaggc actttgtaac tgtgcatatg aggaacgatt gt 899 



<210> 30 

<211> 217 

<212> PRT 

<2 13 > Mus musculus 



<400> 30 

Met lie Pro Gin Val Val Thr Asn Glu Thr lie Thr Thr lie Ser Pro 
15 10 15 



Asn Gly lie Asn Phe Pro Gin Lys Asp Glu Ser Gin Pro Thr Gin Gin 

20 25 30 



Arg Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu lie Lys Val lie 
35 40 45 



Val Ala lie Gin lie Met Cys Ala Val Thr Val Leu Ala Leu Gly lie 
50 55 60 
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lie Leu Ala Ser Val Pro Pro Val Pro Tyr Phe Asn Ser Val Phe Ser 
65 70 75 80 



Val Leu Leu Lys Ser Gly Tyr Pro Phe lie Gly Ala Leu Phe Phe lie 

85 90 95 



Ala Ser Gly lie Leu Ser lie lie Thr Glu Arg Lys Ser Thr Lys Pro 

100 105 110 



Leu Val Asp Ala Ser Leu Thr Leu Asn lie Leu Ser Val Ser Phe Ala 
115 120 125 



Phe Val Gly lie lie lie lie Ser Val Ser Leu Ala Gly Leu His Pro 
130 135 140 



Ala Ser Glu Gin Cys Lys Gin Ser Lys Glu Leu Ser Leu lie Glu His 
145 150 155 160 



Asp Tyr Tyr Gin Pro Phe Tyr Asn Ser Asp Arg Ser Glu Cys Ala Val 

165 170 175 



Thr Lys Ser lie Leu Thr Gly Ala Leu Ser Val Met Leu lie lie Ser 

180 185 190 



Val Leu Glu Leu Gly Leu Ala Leu Leu Ser Ala Met Leu Trp Leu Arg 
195 200 205 



Glu Gly Val Leu Thr Ser Leu Arg Met 
210 215 



<210> 31 

<211> 1370 

<212> DNA 

<213> Mus musculus 



<220> 

<221> cDNA 

<222> (1) . . (1370) 



<220> 
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<221> CDS 

<222> (110) . . (853) 



<400> 31 

caaggatcag actgagaaaa cccatagtta aagctcttgg ggtttctgag agctcagctg 60 

10 gaagtgactg ggtgacaagg cgcacaggct cagccgtgga agctccatc atg att cca 118 

Met lie Pro 

1 

caa gta gtg acc agt gag act gtc aca gtg att tea cca aat gga ate 166 
15 Gin Val Val Thr Ser Glu Thr Val Thr Val lie Ser Pro Asn Gly lie 

5 10 15 

age ttt ccc caa aca gac aaa ccc cag cct tec cac cag age caa gac 214 
Ser Phe Pro Gin Thr Asp Lys Pro Gin Pro Ser His Gin Ser Gin Asp 
20 20 25 30 35 

age ctg aag aaa cat eta aag get gag ate aaa gtg atg gcg gca ate 262 

Ser Leu Lys Lys His Leu Lys Ala Glu lie Lys Val Met Ala Ala lie 

40 45 50 

25 

cag ate atg tgt get gtg atg gtg ttg agt ctg gga ate att ttg gca 310 

Gin lie Met Cys Ala Val Met Val Leu Ser Leu Gly lie lie Leu Ala 

55 60 65 

30 tct gtt ccc tec aat eta cac ttt acc tea gtg ttt tec ate ctg tta 358 

Ser Val Pro Ser Asn Leu His Phe Thr Ser Val Phe Ser lie Leu Leu 
70 75 80 

gaa tct ggc tac cca ttt gta gga get ttg ttt ttt gee ate tct gga 406 
35 Glu Ser Gly Tyr Pro Phe Val Gly Ala Leu Phe Phe Ala lie Ser Gly 

85 90 95 

att ctg tct att gtc aca gag aaa aag atg act aag cct ttg gtt cac 454 
lie Leu Ser lie Val Thr Glu Lys Lys Met Thr Lys Pro Leu Val His 
40 100 105 110 115 

age age eta gee ctg age ate ctg agt gtc etc tct get ctt aca ggc 502 
Ser Ser Leu Ala Leu Ser lie Leu Ser Val Leu Ser Ala Leu Thr Gly 

120 125 130 



45 



ate get att etc tct gtc agt ttg get get tta gag cct gec ttg cag 550 
lie Ala lie Leu Ser Val Ser Leu Ala Ala Leu Glu Pro Ala Leu Gin 

135 140 145 



50 caa tgt aag ctg get ttc aca caa eta gac aca acc caa gac get tat 598 

Gin Cys Lys Leu Ala Phe Thr Gin Leu Asp Thr Thr Gin Asp Ala Tyr 
150 155 160 

cat ttc ttt age cct gag cca tta aac age tgc ttc gtg gec aaa get 646 

55 His Phe Phe Ser Pro Glu Pro Leu Asn Ser Cys Phe Val Ala Lys Ala 

165 170 175 

get ctg act gga gtc ttt tea ctg atg eta ate age agt gtg ttg gag 694 

Ala Leu Thr Gly Val Phe Ser Leu Met Leu lie Ser Ser Val Leu Glu 
60 180 185 190 195 
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ctt ggc ctg get gtc etc act gec aca ctg tgg tgg aaa cag age tec 742 
Leu Gly Leu Ala Val Leu Thr Ala Thr Leu Trp Trp Lys Gin Ser Ser 

200 205 210 

tct get ttc tct ggg aat gtg att ttc ctg tct cag aac tea aag aat 790 
Ser Ala Phe Ser Gly Asn Val lie Phe Leu Ser Gin Asn Ser Lys Asn 

215 220 225 

aaa tec agt gta tct tea gag tea ctt tgt aac cct aca tat gaa aac 838 
Lys Ser Ser Val Ser Ser Glu Ser Leu Cys Asn Pro Thr Tyr Glu Asn 
230 235 240 

ata ttg act tea taa gaattaagta gaggttatat agcagaaaaa tctgtcttta 893 
lie Leu Thr Ser 
245 

acatgattta gaaaagecat ttactgtgtg acaacaatgc ttaatatctt gatatcttaa 953 

tgtgtgtatt ggttaatcag caaccatgaa aaacatacta actggctggg ttcagtagca 1013 

cgctcttgat ttggtgtcag tcaaaacaca gacctgtaaa ttccaattta tgtagtggtc 1073 

aaagagcccc aattattttc tcaaaaaact ggaagaatgt ttcataggat catggtggag 1133 

ccaatgggca acagttcttc ttatccttgt cacttggctg caggaggtac tgactagggc 1193 

ctgagatcat attctgtgtg cgtggcatgg acttcatggc atctatttta ttcataagca 1253 

catgaaaaca agtcatctct tatgaagtct caaagagcat aaaaaagtta gcctccaaat 1313 

aaagtcttta tataatcccc ccaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 1370 

<210> 32 

<211> 247 

<212> PRT 

<213> Mus musculus 



<400> 32 

Met lie Pro Gin Val Val Thr Ser Glu Thr Val Thr Val lie Ser Pro 
15 10 15 



Asn Gly lie Ser Phe Pro Gin Thr Asp Lys Pro Gin Pro Ser His Gin 

20 25 30 



Ser Gin Asp Ser Leu Lys Lys His Leu Lys Ala Glu lie Lys Val Met 
35 40 45 



Ala Ala lie Gin lie Met Cys Ala Val Met Val Leu Ser Leu Gly lie 
50 55 60 
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lie Leu Ala Ser Val Pro Ser Asn Leu His Phe Thr Ser Val Phe Ser 
65 70 75 80 



lie Leu Leu Glu Ser Gly Tyr Pro Phe Val Gly Ala Leu Phe Phe Ala 

85 90 95 



lie Ser Gly lie Leu Ser lie Val Thr Glu Lys Lys Met Thr Lys Pro 

100 105 110 



Leu Val His Ser Ser Leu Ala Leu Ser lie Leu Ser Val Leu Ser Ala 
115 120 125 



Leu Thr Gly lie Ala lie Leu Ser Val Ser Leu Ala Ala Leu Glu Pro 
130 135 140 



Ala Leu Gin Gin Cys Lys Leu Ala Phe Thr Gin Leu Asp Thr Thr Gin 
145 150 155 160 



Asp Ala Tyr His Phe Phe Ser Pro Glu Pro Leu Asn Ser Cys Phe Val 

165 170 175 



Ala Lys Ala Ala Leu Thr Gly Val Phe Ser Leu Met Leu lie Ser Ser 

180 185 190 



Val Leu Glu Leu Gly Leu Ala Val Leu Thr Ala Thr Leu Trp Trp Lys 
195 200 205 



Gin Ser Ser Ser Ala Phe Ser Gly Asn Val lie Phe Leu Ser Gin Asn 
210 215 220 



Ser Lys Asn Lys Ser Ser Val Ser Ser Glu Ser Leu Cys Asn Pro Thr 
225 230 235 240 



Tyr Glu Asn lie Leu Thr Ser 

245 



<210> 33 

<211> 938 

<212> DNA 

<213> Mus musculus 
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<220> 

<221> CDNA 
<222> (1) . . (938) 



<220> 

<221> CDS 

<222> (105) . . (809) 



<400> 33 

aacagctggt aagaggttga gtctgagttc cccaggagca gagtgagcag gcccaggagc 60 

tgcagcacac aggacagggc tgtgaggaca gcatccctgg cacc atg cgt eta cag 116 

Met Arg Leu Gin 
1 

ctt ggc acc aag aac att ggg tgg gac tgc ttt cca aag gac ate att 164 
Leu Gly Thr Lys Asn lie Gly Trp Asp Cys Phe Pro Lys Asp lie lie 
5 10 15 20 

ate cac aaa aga gag aaa act gga cat aca tat gaa aaa gaa gat gac 212 
lie His Lys Arg Glu Lys Thr Gly His Thr Tyr Glu Lys Glu Asp Asp 

25 30 35 

ctg ctg att gga gtg cct agt gaa gee aca ctt ctt gga acc ate cag 260 
Leu Leu lie Gly Val Pro Ser Glu Ala Thr Leu Leu Gly Thr lie Gin 

40 45 50 

eta eta tgt gee ctg ata ctt gca age ttt ggg ggc att ttg gtg tea 308 
Leu Leu Cys Ala Leu lie Leu Ala Ser Phe Gly Gly lie Leu Val Ser 
55 60 65 

get tec tac ttc aac cca gaa gtt tec acc act ttg ata tct gga tac 356 
Ala Ser Tyr Phe Asn Pro Glu Val Ser Thr Thr Leu lie Ser Gly Tyr 
70 75 80 

ctg ttt att gga tec etc tgt ttt gee att get gga att etc tec att 404 
Leu Phe lie Gly Ser Leu Cys Phe Ala lie Ala Gly lie Leu Ser lie 
85 90 95 100 

ate tct gaa aaa ate tea acc aaa ccc ttt gca ctg age age ctg gee 452 
lie Ser Glu Lys lie Ser Thr Lys Pro Phe Ala Leu Ser Ser Leu Ala 

105 110 115 

tec aat gta gca age tct gtt gtt gee gtc att ggc etc ttc etc ttc 500 
Ser Asn Val Ala Ser Ser Val Val Ala Val lie Gly Leu Phe Leu Phe 

120 125 130 

acc tat tgt ctg ata gee ctg ggg agt get ttc cca cac tgt aac tea 548 
Thr Tyr Cys Leu lie Ala Leu Gly Ser Ala Phe Pro His Cys Asn Ser 
135 140 145 

gaa aag aaa ttc ctg tec tta ttg tct tac ttg aaa tec cac cat tgg 596 
Glu Lys Lys Phe Leu Ser Leu Leu Ser Tyr Leu Lys Ser His His Trp 
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150 155 160 



aag aat gaa gac aag aac tgt tac ctg get tat gtc ggt get atg agt 644 
Lys Asn Glu Asp Lys Asn Cys Tyr Leu Ala Tyr Val Gly Ala Met Ser 
165 170 175 180 

get ctg ggg atg atg ctt etc ttc act gtg ctg gag gtt ttc eta get 692 
Ala Leu Gly Met Met Leu Leu Phe Thr Val Leu Glu Val Phe Leu Ala 

185 190 195 

gga tac agt tct ate ttt tgg tgg aag caa gtc tac tec aac aaa cct 740 
Gly Tyr Ser Ser lie Phe Trp Trp Lys Gin Val Tyr Ser Asn Lys Pro 

200 205 210 

999 99^ aca ttt ttc ttg cct caa tea caa gat cat ace caa ctt gtc 788 
Gly Gly Thr Phe Phe Leu Pro Gin Ser Gin Asp His Thr Gin Leu Val 
215 220 225 

aaa age aat ttg ttg caa taa tggatatgaa ataaaaaaga agaaagaaga 83 9 

Lys Ser Asn Leu Leu Gin 
230 

aatctgttgc tcacagcaga gtacagctag acttgectga aatctcagct ttggtagatc 899 
ctaataaaca tctgaagaaa aaaaaaaaaa aaaaaaaaa 93 8 



<210> 34 

<211> 234 

<212> PRT 

<213> Mus musculus 



<400> 34 

Met Arg Leu Gin Leu Gly Thr Lys Asn lie Gly Trp Asp Cys Phe Pro 
15 10 15 



Lys Asp lie lie lie His Lys Arg Glu Lys Thr Gly His Thr Tyr Glu 

20 25 30 



Lys Glu Asp Asp Leu Leu lie Gly Val Pro Ser Glu Ala Thr Leu Leu 
35 40 45 



Gly Thr lie Gin Leu Leu Cys Ala Leu lie Leu Ala Ser Phe Gly Gly 
50 55 60 



lie Leu Val Ser Ala Ser Tyr Phe Asn Pro Glu Val Ser Thr Thr Leu 
65 70 75 80 



lie Ser Gly Tyr Leu Phe lie Gly Ser Leu Cys Phe Ala lie Ala Gly 
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85 90 95 



lie Leu Ser lie lie Ser Glu Lys lie Ser Thr Lys Pro Phe Ala Leu 

100 105 110 



Ser Ser Leu Ala Ser Asn Val Ala Ser Ser Val Val Ala Val lie Gly 
115 120 125 



Leu Phe Leu Phe Thr Tyr Cys Leu lie Ala Leu Gly Ser Ala Phe Pro 
130 135 140 



His Cys Asn Ser Glu Lys Lys Phe Leu Ser Leu Leu Ser Tyr Leu Lys 
145 150 155 160 



Ser His His Trp Lys Asn Glu Asp Lys Asn Cys Tyr Leu Ala Tyr Val 

165 170 175 



Gly Ala Met Ser Ala Leu Gly Met Met Leu Leu Phe Thr Val Leu Glu 

180 185 190 



Val Phe Leu Ala Gly Tyr Ser Ser lie Phe Trp Trp Lys Gin Val Tyr 
195 200 205 



Ser Asn Lys Pro Gly Gly Thr Phe Phe Leu Pro Gin Ser Gin Asp His 
210 215 220 



Thr Gin Leu Val Lys Ser Asn Leu Leu Gin 
225 230 



<210> 35 

<211> 1025 

<212> DNA 

<213> Mus musculus 



<220> 

<221> cDNA 

<222> (1) . . (1025) 



<220> 
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15 



55 



<221> CDS 

<222> (12) . . (818) 



<400> 35 

aattcgtcga c atg gaa ccg gag caa gag aga ctt act tgg caa ccg ggc 50 

Met Glu Pro Glu Gin Glu Arg Leu Thr Trp Gin Pro Gly 
10 15 10 



aca gta age atg aat aca gta act tea ccg ggc ccc atg gec aat tct 98 
Thr Val Ser Met Asn Thr Val Thr Ser Pro Gly Pro Met Ala Asn Ser 
15 20 25 

gta tat gtg gtg gca ccg ccc aac agt tat cct gtg gtc cca gga acc 146 
Val Tyr Val Val Ala Pro Pro Asn Ser Tyr Pro Val Val Pro Gly Thr 
30 35 40 45 

20 gtg cct cag atg cct att tat ccc age aat cag cct caa gtc cat gtg 194 

Val Pro Gin Met Pro lie Tyr Pro Ser Asn Gin Pro Gin Val His Val 

50 55 60 

att tct ggt cat ctg cct ggt ttg gtg cca get atg acc gaa cca cct 242 
25 lie Ser Gly His Leu Pro Gly Leu Val Pro Ala Met Thr Glu Pro Pro 

65 70 75 

gee cag aga gtc ttg aaa aag gga caa gtc eta ggg gee ate cag ate 2 90 

Ala Gin Arg Val Leu Lys Lys Gly Gin Val Leu Gly Ala lie Gin lie 
30 80 85 90 

ctg ate ggc ctg gtg cac ate ggc ctg ggc tec ate atg ate acc aac 33 8 

Leu lie Gly Leu Val His lie Gly Leu Gly Ser lie Met lie Thr Asn 
95 100 105 

35 

etc ttc age cac tat aca cct gtc tec etc tat gga ggt ttc ccc ttc 386 

Leu Phe Ser His Tyr Thr Pro Val Ser Leu Tyr Gly Gly Phe Pro Phe 

110 115 120 125 

40 tgg gga ggc ate tgg ttt ate att tea gga tct etc tea gtg gcg gca 434 

Trp Gly Gly lie Trp Phe lie lie Ser Gly Ser Leu Ser Val Ala Ala 

130 135 140 

gaa act cag ccc aat tea cct tgc ctg ctg aat ggc agt gtg ggc ttg 482 
45 Glu Thr Gin Pro Asn Ser Pro Cys Leu Leu Asn Gly Ser Val Gly Leu 

145 150 155 

aac ate ttc age gee ate tgt tct gcg gtg ggc ate atg etc ttc ate 530 
Asn 'lie Phe Ser Ala lie Cys Ser Ala Val Gly lie Met Leu Phe lie 
50 160 165 170 

aca gac ate agt ate tea tea gga tac ate tac cct age tac tac cct 578 
Thr Asp lie .Ser lie Ser Ser Gly Tyr lie Tyr Pro Ser Tyr Tyr Pro 
175 180 185 



tac cag gag aac ttg ggt gtg aga aca ggc gtg get att tct agt gtg 62 6 

Tyr Gin Glu Asn Leu Gly Val Arg Thr Gly Val Ala lie Ser Ser Val 
190 195 200 205 



60 eta etc ate ttc tgc etc ttg gag etc age att gca agt gtg tec tec 674 

Leu Leu lie Phe Cys Leu Leu Glu Leu Ser lie Ala Ser Val Ser Ser 
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210 215 220 

cac ttt ggc tgc caa gtg gcc tgt tgt cac tac aac aat cca ggt gtg 722 
His Phe Gly Cys Gin Val Ala Cys Cys His Tyr Asn Asn Pro Gly Val 

225 230 235 

gtc att cca aat gtc tat gca gca aac cca gtg gtc ate cca gaa cca 770 
Val lie Pro Asn Val Tyr Ala Ala Asn Pro Val Val lie Pro Glu Pro 
240 245 250 

cca aac cca ata cca agt tat tec gaa gta gtt caa gac tec aga taa 818 
Pro Asn Pro lie Pro Ser Tyr Ser Glu Val Val Gin Asp Ser Arg 
255 260 265 

gtgaacctga agattctaga gaccaagtga catcctctcc tacctagact cctataaacc 878 

aagttcttcc tttcctgacg aagggtaaat atctttcttg tggcctaaat tatagactct 938 

tgcttcaact caccctggaa aaatctctat taaaacgaga tgggagattg aaatggattc 998 

aaataaagat gctctagccg gaaaaaa 1025 

<210> 36 

<211> 268 

<212> PRT 

<213> Mus musculus 



<400> 36 

Met Glu Pro Glu Gin Glu Arg Leu Thr Trp Gin Pro Gly Thr Val Ser 
15 10 15 



Met Asn Thr Val Thr Ser Pro Gly Pro Met Ala Asn Ser Val Tyr Val 

20 25 30 



Val Ala Pro Pro Asn Ser Tyr Pro Val Val Pro Gly Thr Val Pro Gin 
35 40 45 



Met Pro lie Tyr Pro Ser Asn Gin Pro Gin Val His Val lie Ser Gly 
50 55 60 



His Leu Pro Gly Leu Val Pro Ala Met Thr Glu Pro Pro Ala Gin Arg 
65 70 75 80 



Val Leu Lys Lys Gly Gin Val Leu Gly Ala lie Gin lie Leu lie Gly 

85 90 95 



Leu Val His lie Gly Leu Gly Ser lie Met lie Thr Asn Leu Phe Ser 

100 105 110 
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His Tyr Thr Pro Val Ser Leu Tyr Gly Gly Phe Pro Phe Trp Gly Gly 
115 120 125 



lie Trp Phe lie lie Ser Gly Ser Leu Ser Val Ala Ala Glu Thr Gin 
130 135 140 



Pro Asn Ser Pro Cys Leu Leu Asn Gly Ser Val Gly Leu Asn lie Phe 
145 150 155 160 



Ser Ala lie Cys Ser Ala Val Gly lie Met Leu Phe lie Thr Asp lie 

165 170 175 



Ser lie Ser Ser Gly Tyr lie Tyr Pro Ser Tyr Tyr Pro Tyr Gin Glu 

180 185 190 



Asn Leu Gly Val Arg Thr Gly Val Ala lie Ser Ser Val Leu Leu lie 
195 200 205 



Phe Cys Leu Leu Glu Leu Ser lie Ala Ser Val Ser Ser His Phe Gly 
210 215 220 



Cys Gin Val Ala Cys Cys His Tyr Asn Asn Pro Gly Val Val lie Pro 
225 230 235 240 



Asn Val Tyr Ala Ala Asn Pro Val Val lie Pro Glu Pro Pro Asn Pro 

245 250 255 



lie Pro Ser Tyr Ser Glu Val Val Gin Asp Ser Arg 

260 265 



<210> 37 

<211> 1038 

<212> DNA 

<213> Mus musculus 



<220> 

<221> cDNA 

<222> (1) - . (1038) 
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15 



40 



60 



<22 0> 

<221> CDS 

<222> (75) . . (878) 



<400> 37 

10 gctctggatg gtcaggcaca gctgaagcct acaacagagt tcccggggcc agccagggtt 60 



cccggccagg gtca atg get ggc caa get ccc aca gcg gtt ccc ggc agt 110 

Met Ala Gly Gin Ala Pro Thr Ala Val Pro Gly Ser 
15 10 

gtt act gga gaa gtc tea cga tgg cag aac eta gga cct gec cag cct 158 
Val Thr Gly Glu Val Ser Arg Trp Gin Asn Leu Gly Pro Ala Gin Pro 
15 20 25 



20 gca cag aaa gta gec cag ccc caa aac ctg gtt cca gat ggg cac ctt 206 

Ala Gin Lys Val Ala Gin Pro Gin Asn Leu Val Pro Asp Gly His Leu 
30 35 40 

gag aaa gec ctg gag gga agt gac ctt eta cag aag ttg ggt ggc ttt 254 
25 Glu Lys Ala Leu Glu Gly Ser Asp Leu Leu Gin Lys Leu Gly Gly Phe 

45 50 55 60 

cac ate gec att get ttt gca cac ttg gee ttt ggt ggc tac ttg ate 302 
His lie Ala lie Ala Phe Ala His Leu Ala Phe Gly Gly Tyr Leu lie 
30 65 70 75 

tec aca gtc aag aat ctg cac ctg gtg gtg ttg aag tgc tgg tat cca 350 
Ser Thr Val Lys Asn Leu His Leu Val Val Leu Lys Cys Trp Tyr Pro 

80 85 90 

35 

etc tgg gga act gtc tct ttt etc gtt gca ggg atg gcg gee atg aca 398 
Leu Trp Gly Thr Val Ser Phe Leu Val Ala Gly Met Ala Ala Met Thr 
95 100 105 

aca gtg aca ttc cct aag ace tct ctg aag gtt etc tgt gtg ata gee 446 
Thr Val Thr Phe Pro Lys Thr Ser Leu Lys Val Leu Cys Val lie Ala 
110 115 120 

45 aac gtt ate age ttg ttc tgc gca ctg gee ggc ttc ttt gtc att gee 494 

Asn Val lie Ser Leu Phe Cys Ala Leu Ala Gly Phe Phe Val lie Ala 
125 130 135 140 

aag gac etc ttc ctg gag ggt cct ttt cca tgg cca ate tgg aga cca 542 
50 Lys Asp Leu Phe Leu Glu Gly Pro Phe Pro Trp Pro lie Trp Arg Pro 

145 150 155 

tac cct gaa ccc aca ace tac ate caa agg eta gag ctg ace ctg ttc 590 
Tyr Pro Glu Pro Thr Thr Tyr lie Gin Arg Leu Glu Leu Thr Leu Phe 
55 160 165 170 

tgc ttc ace ttc ctg gag ate ttc ctg teg ggg tec aca gee ate aca 638 
Cys Phe Thr Phe Leu Glu lie Phe Leu Ser Gly Ser Thr Ala lie Thr 
175 180 185 



gee tat agg atg aaa cgc ctg caa gca gag gac aag gat gac ace cct 686 
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Ala Tyr Arg Met Lys Arg Leu Gin Ala Glu Asp Lys Asp Asp Thr Pro 
190 195 200 

ttc gtt cct gac aca cct atg gaa ctt aaa ggt eta teg ctg ggg cca 734 
Phe Val Pro Asp Thr Pro Met Glu Leu Lys Gly Leu Ser Leu Gly Pro 
205 210 215 220 

cca cca tec tat aaa gat gtg gec caa gga cac tec tec tct gat act 782 
Pro Pro Ser Tyr Lys Asp Val Ala Gin Gly His Ser Ser Ser Asp Thr 

225 230 235 

ggc aga gee ttg gee acc age tct gga ctg ctg ctg gee tct gac tec 830 
Gly Arg Ala Leu Ala Thr Ser Ser Gly Leu Leu Leu Ala Ser Asp Ser 

240 245 250 

ttc cat caa get ctg ctt cac acg ggc cca agg acc ttg aga aag tag 878 
Phe His Gin Ala Leu Leu His Thr Gly Pro Arg Thr Leu Arg Lys 
255 260 265 

tctaagacta ctagactttc ccaataaaga ctgagttcct geaatgecaa gtccttccct 93 8 

gcctcccatt ggctggactc eggtgatgae acaggcgtga ettgeacaaa taaatgeata 998 

accactatga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 103 8 

<210> 38 

<211> 267 

<212> PRT 

<213> Mus musculus 



<400> 38 

Met Ala Gly Gin Ala Pro Thr Ala Val Pro Gly Ser Val Thr Gly Glu 
1 5 10 15 



Val Ser Arg Trp Gin Asn Leu Gly Pro Ala Gin Pro Ala Gin Lys Val 

20 25 30 



Ala Gin Pro Gin Asn Leu Val Pro Asp Gly His Leu Glu Lys Ala Leu 
35 40 45 



Glu Gly Ser Asp Leu Leu Gin Lys Leu Gly Gly Phe His lie Ala lie 
50 55 60 



Ala Phe Ala His Leu Ala Phe Gly Gly Tyr Leu lie Ser Thr Val Lys 
65 70 75 80 



Asn Leu His Leu Val Val Leu Lys Cys Trp Tyr Pro Leu Trp Gly Thr 

85 90 95 
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Val Ser Phe Leu Val Ala Gly Met Ala Ala Met Thr Thr Val Thr Phe 

100 105 110 



Pro Lys Thr Ser Leu Lys Val Leu Cys Val lie Ala Asn Val lie Ser 
115 120 125 



Leu Phe Cys Ala Leu Ala Gly Phe Phe Val lie Ala Lys Asp Leu Phe 
130 135 140 



Leu Glu Gly Pro Phe Pro Trp Pro lie Trp Arg Pro Tyr Pro Glu Pro 
145 150 155 160 



Thr Thr Tyr lie Gin Arg Leu Glu Leu Thr Leu Phe Cys Phe Thr Phe 

165 170 175 



Leu Glu lie Phe Leu Ser Gly Ser Thr Ala lie Thr Ala Tyr Arg Met 

180 185 190 



Lys Arg Leu Gin Ala Glu Asp Lys Asp Asp Thr Pro Phe Val Pro Asp 
195 200 205 



Thr Pro Met Glu Leu Lys Gly Leu Ser Leu Gly Pro Pro Pro Ser Tyr 
210 215 220 



Lys Asp Val Ala Gin Gly His Ser Ser Ser Asp Thr Gly Arg Ala Leu 
225 230 235 240 



Ala Thr Ser Ser Gly Leu Leu Leu Ala Ser Asp Ser Phe His Gin Ala 

245 250 255 



Leu Leu His Thr Gly Pro Arg Thr Leu Arg Lys 

260 265 



<210> 39 

<211> 1146 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDNA 

<222> (1) . . (1146) 
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<220> 

<221> CDS 

<222> (134) . . (1027) 



<300> 

<301> Tedder, TF, Klejman, G, Schlossman, SF, Saito, H 

<3 02> Structure of the gene encoding the human B lymphocyte 
differentiation antigen CD20 (Bl) 

<303> J Immunol 

<304> 142 

<305> 7 

<306> 2560-2568 

<307> 1989- - 

<309> - - 

<300> 

<301> Tedder, TF, Streuli, M, Schlossman, SF, and Saito H 

<302> Isolation and structure of a cDNA encoding the Bl (CD20) cell 
surface antigen of human B lymphocytes 

<303> Proc Natl Acad Sci USA 

<304> 85 

<305> 1 

<30.6> 208-212 

<307> 1988- - 

<309> 



<400> 39 

cctcaatgac actcatggag gaaatgctga gagaagcatt cagatgcatg acacaaggta 60 

agactgccaa aaatcttgtt cttgctctcc tcattttgtt atttgtttta tttttaggag 120 

ttttgagagc aaa atg aca aca ccc aga aat tea gta aat ggg act ttc 169 

Met Thr Thr Pro Arg Asn Ser Val Asn Gly Thr Phe 
15 10 

ccg gca gag cca atg aaa ggc cct att get atg caa tct ggt cca aaa 217 
Pro Ala Glu Pro Met Lys Gly Pro lie Ala Met Gin Ser Gly Pro Lys 
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15 20 25 

cca etc ttc agg agg atg tct tea ctg gtg ggc ccc acg caa age ttc 265 
Pro Leu Phe Arg Arg Met Ser Ser Leu Val Gly Pro Thr Gin Ser Phe 
30 35 40 

ttc atg agg gaa tct aag act ttg ggg get gtc cag att atg aat ggg 313 
Phe Met Arg Glu Ser Lys Thr Leu Gly Ala Val Gin lie Met Asn Gly 
45 50 55 60 

etc ttc cac att gec ctg ggg ggt ctt ctg atg ate cca gca ggg ate 361 
Leu Phe His lie Ala Leu Gly Gly Leu Leu Met lie Pro Ala Gly lie 

65 70 75 



tat gca ccc 
Tyr Ala Pro 



atg tat att 
Met Tyr lie 
95 

agg aag tgt 
Arg Lys Cys 
110 

ttt get gee 
Phe Ala Ala 
125 

att aaa att 
lie Lys lie 



get cac aca 
Ala His Thr 



tct gag aaa 
Ser Glu Lys 
175 

ctg ttc ttg 
Leu Phe Leu 
190 

gaa ctt gta 
Glu Leu Val 
205 

tec aga ccc 
Ser Arg Pro 



gaa cag act 
Glu Gin Thr 



tct tec caa 



ate tgt gtg 
lie Cys Val 
80 

att tec gga 
lie Ser Gly 



ttg gtc aaa 
Leu Val Lys 



att tct gga 
lie Ser Gly 
130 

tec cat ttt 
Ser His Phe 
145 

cca tat att 
Pro Tyr lie 
160 

aac tec cca 
Asn Ser Pro 



ggc att ttg 
Gly lie Leu 



ata get ggc 
lie Ala Gly 
210 

aaa tct aac 
Lys Ser Asn 
225 

att gaa ata 
lie Glu lie 
240 

cca aag aat 



act gtg tgg 
Thr Val Trp 
85 

tea etc ttg 
Ser Leu Leu 
100 

gga aaa atg 
Gly Lys Met 
115 

atg att ctt 
Met lie Leu 



tta aaa atg 
Leu Lys Met 



aac ata tac 
Asn lie Tyr 
165 

tct acc caa 
Ser Thr Gin 
180 

tea gtg atg 
Ser Val Met 
195 

ate gtt gag 
lie Val Glu 



ata gtt etc 
lie Val Leu 



aaa gaa gaa 
Lys Glu Glu 
245 

gaa gaa gac 



tac cct etc 
Tyr Pro Leu 



gca gca acg 
Ala Ala Thr 



ata atg aat 
lie Met Asn 
120 

tea ate atg 
Ser lie Met 
135 

gag agt ctg 
Glu Ser Leu 
150 

aac tgt gaa 
Asn Cys Glu 



tac tgt tac 
Tyr Cys Tyr 



ctg ate ttt 
Leu lie Phe 
200 

aat gaa tgg 
Asn Glu Trp 
215 

ctg tea gca 
Leu Ser Ala 
230 

gtg gtt ggg 
Val Val Gly 



att gaa att 



tgg gga ggc 
Trp Gly Gly 
90 

gag aaa aac 
Glu Lys Asn 
105 

tea ttg age 
Ser Leu Ser 



gac ata ctt 
Asp lie Leu 



aat ttt att 
Asn Phe lie 
155 

cca get aat 
Pro Ala Asn 
170 

age ata caa 
Ser lie Gin 
185 

gee ttc ttc 
Ala Phe Phe 



aaa aga acg 
Lys Arg Thr 



gaa gaa aaa 
Glu Glu Lys 
235 

eta act gaa 
Leu Thr Glu 
250 

att cca ate 



att 409 
lie 



tct 457 
Ser 



etc 505 
Leu 



aat 553 

Asn 

140 

aga 601 
Arg 



ccc 649 
Pro 



tct 697 
Ser 



cag 745 
Gin 



tgc 793 

Cys 

220 

aaa 841 
Lys 



aca 889 
Thr 



caa 937 
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Ser Ser Gin Pro Lys Asn Glu Glu Asp lie Glu lie lie Pro lie Gin 
255 260 265 

gaa gag gaa gaa gaa gaa aca gag acg aac ttt cca gaa cct ccc caa 985 
Glu Glu Glu Glu Glu Glu Thr Glu Thr Asn Phe Pro Glu Pro Pro Gin 
270 275 280 

gat cag gaa tec tea cca ata gaa aat gac age tct cct taa 1027 
Asp Gin Glu Ser Ser Pro lie Glu Asn Asp Ser Ser Pro 
285 290 295 

gtgatttctt ctgttttctg tttccttttt taaacattag tgttcatagc ttccaagaga 1087 
catgetgact ttcatttctt gaggtactct gcacatacgc accacatctc tatctggee 1146 

<210> 40 

<211> 297 

<212> PRT 

<213> Homo sapiens 



<400> 40 

Met Thr Thr Pro Arg Asn Ser Val Asn Gly Thr Phe Pro Ala Glu Pro 
15 10 15 



Met Lys Gly Pro lie Ala Met Gin Ser Gly Pro Lys Pro Leu Phe Arg 

20 25 30 



Arg Met Ser Ser Leu Val Gly Pro Thr Gin Ser Phe Phe Met Arg Glu 
35 40 45 



Ser Lys Thr Leu Gly Ala Val Gin lie Met Asn Gly Leu Phe His lie 
50 55 60 



Ala Leu Gly Gly Leu Leu Met lie Pro Ala Gly lie Tyr Ala Pro lie 
65 70 75 80 



Cys Val Thr Val Trp Tyr Pro Leu Trp Gly Gly lie Met Tyr lie lie 

85 90 95 



Ser Gly Ser Leu Leu Ala Ala Thr Glu Lys Asn Ser Arg Lys Cys Leu 

100 105 110 



Val Lys Gly Lys Met lie Met Asn Ser Leu Ser Leu Phe Ala Ala lie 
115 120 125 



Ser Gly Met lie Leu Ser lie Met Asp lie Leu Asn lie Lys lie Ser 
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130 135 140 



His Phe Leu Lys Met Glu Ser Leu Asn Phe lie Arg Ala His Thr Pro 
145 150 155 160 



Tyr lie Asn lie Tyr Asn Cys Glu Pro Ala Asn Pro Ser Glu Lys Asn 

165 170 175 



Ser Pro Ser Thr Gin Tyr Cys Tyr Ser lie Gin Ser Leu Phe Leu Gly 

180 185 190 



lie Leu Ser Val Met Leu lie Phe Ala Phe Phe Gin Glu Leu Val lie 
195 200 205 



Ala Gly lie Val Glu Asn Glu Trp Lys Arg Thr Cys Ser Arg Pro Lys 
210 215 220 



Ser Asn lie Val Leu Leu Ser Ala Glu Glu Lys Lys Glu Gin Thr lie 
225 230 235 240 



Glu lie Lys Glu Glu Val Val Gly Leu Thr Glu Thr Ser Ser Gin Pro 

245 250 255 



Lys Asn Glu Glu Asp lie Glu lie lie Pro lie Gin Glu Glu Glu Glu 

260 265 270 



Glu Glu Thr Glu Thr Asn Phe Pro Glu Pro Pro Gin Asp Gin Glu Ser 
275 280 285 



Ser Pro lie Glu Asn Asp Ser Ser Pro 
290 295 



<210> 41 

<211> 894 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDNA 
<222> (1) . . (894) 



-58- 



WO 02/062946 



PCT/US01/48437 



<220> 

<221> CDS 

<222> (51) . . (785) 

<300> 

<301> Kuster, H, Zhang, L, Brini, AT, MacGlashan, DWJ, Kinet, J"-P 

<3 02> The gene and cDNA for the human high affinity immunoglobulin E 
receptor beta chain and expression of the complete human receptor 

<303> J Biol Cham 

<304> 267 

<305> 18 

<306> 12782-12787 

<307> 1992- - 

<309> 



<400> 41 

tctcaatata ataatattct ttattcctgg acagctcggt taatgaaaaa atg gac 56 

Met Asp 
1 

aca gaa agt aat agg aga gca aat ctt get etc cca cag gag cct tec 104 
Thr Glu Ser Asn Arg Arg Ala Asn Leu Ala Leu Pro Gin Glu Pro Ser 
5 10 15 

agt gtg cct gca ttt gaa gtc ttg gaa ata tct ccc cag gaa gta tct 152 
Ser Val Pro Ala Phe Glu Val Leu Glu lie Ser Pro Gin Glu Val Ser 
20 25 30 



tea ggc aga 
Ser Gly Arg 
35 

ctg aca gtt 
Leu Thr Val 



ctg act get 
Leu Thr Ala 



ctt gat att 
Leu Asp lie 
85 

ggt tat cca 



eta ttg aag 
Leu Leu Lys 
40 

ttg aaa aaa 
Leu Lys Lys 
55 

atg ata tgc 
Met lie Cys 
70 

tea cac att 
Ser His lie 



ttc tgg gga 



teg gee tea 
Ser Ala Ser 



gag cag gag 
Glu Gin Glu 



ctt tgt ttt 
Leu Cys Phe 
75 

gag gga gac 
Glu Gly Asp 
90 

gee ata ttt 



tec cca cca 
Ser Pro Pro 
45 

ttc ctg ggg 
Phe Leu Gly 
60 

gga aca gtt 
Gly Thr Val 



att ttt tea 
lie Phe Ser 



ttt tct att 



ctg cat aca 
Leu His Thr 



gta aca caa 
Val Thr Gin 
65 

gtc tgc tct 
Val Cys Ser 
80 

tea ttt aaa 
Ser Phe Lys 
95 

tct gga atg 



tgg 2 0 0 

Trp 

50 

att 248 
lie 



gta 296 
Val 



gca 344 
Ala 



ttg 392 
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Gly Tyr Pro Phe Trp Gly Ala lie Phe Phe Ser lie Ser Gly Met Leu 
100 105 110 

tea att ata tct gaa agg aga aat gca aca tat ctg gtg aga gga age 44 0 

Ser lie lie Ser Glu Arg Arg Asn Ala Thr Tyr Leu Val Arg Gly Ser 
115 120 125 130 

ct 9 99 a 9 ca aac act 9 CC a 9 c a 9 c ata 9 ct 999 99 a ac 9 99 a att acc 488 
Leu Gly Ala Asn Thr Ala Ser Ser lie Ala Gly Gly Thr Gly lie Thr 

135 140 145 

ate ctg ate ate aac ctg aag aag age ttg gee tat ate cac ate cac 536 
lie Leu lie lie Asn Leu Lys Lys Ser Leu Ala Tyr lie His lie His 

150 155 160 

agt tgc cag aaa ttt ttt gag acc aag tgc ttt atg get tec ttt tec 584 
Ser Cys Gin Lys Phe Phe Glu Thr Lys Cys Phe Met Ala Ser Phe Ser 
165 170 175 

act gaa att gta gtg atg atg ctg ttt etc acc att ctg gga ctt ggt 632 
Thr Glu lie Val Val Met Met Leu Phe Leu Thr lie Leu Gly Leu Gly 
180 185 190 

agt get gtg tea etc aca ate tgt gga get ggg gaa gaa etc aaa gga 680 
Ser Ala Val Ser Leu Thr lie Cys Gly Ala Gly Glu Glu Leu Lys Gly 
195 200 205 210 

aac aag gtt cca gag gat cgt gtt tat gaa gaa tta aac ata tat tea 72 8 

Asn Lys Val Pro Glu Asp Arg Val Tyr Glu Glu Leu Asn lie Tyr Ser 

215 220 225 

get act tac agt gag ttg gaa gac cca ggg gaa atg tct cct ccc att 776 
Ala Thr Tyr Ser Glu Leu Glu Asp Pro Gly Glu Met Ser Pro Pro lie 

230 235 240 

gat tta taa gaatcacgtg tccagaacac tctgattcac agecaaggat 825 
Asp Leu 

ccagaaggcc aaggtcttgt taaggggcta ctggaaaaat ttctattctc tccacagcct 885 



gctggtttt 

<210> 42 

<211> 244 

<212> PRT 

<213> Homo sapiens 



<400> . 42 

Met Asp Thr Glu Ser Asn Arg Arg Ala Asn Leu Ala Leu Pro Gin Glu 
15 10 15 



Pro Ser Ser Val Pro Ala Phe Glu Val Leu Glu lie Ser Pro Gin Glu 



894 
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20 25 30 



Val Ser Ser Gly Arg Leu Leu Lys Ser Ala Ser Ser Pro Pro Leu His 
35 40 45 



Thr Trp Leu Thr Val Leu Lys Lys Glu Gin Glu Phe Leu Gly Val Thr 
50 55 60 



Gin lie Leu Thr Ala Met lie Cys Leu Cys Phe Gly Thr Val Val Cys 
65 70 75 80 



Ser Val Leu Asp lie Ser His lie Glu Gly Asp lie Phe Ser Ser Phe 

85 90 95 



Lys Ala Gly Tyr Pro Phe Trp Gly Ala lie Phe Phe Ser lie Ser Gly 

100 105 110 



Met Leu Ser lie lie Ser Glu Arg Arg Asn Ala Thr Tyr Leu Val Arg 
115 120 125 



Gly Ser Leu Gly Ala Asn Thr Ala Ser Ser lie Ala Gly Gly Thr Gly 
130 135 140 



lie Thr lie Leu lie lie Asn Leu Lys Lys Ser Leu Ala Tyr lie His 
145 150 155 160 



lie His Ser Cys Gin Lys Phe Phe Glu Thr Lys Cys Phe Met Ala Ser 

165 170 175 



Phe Ser Thr Glu lie Val Val Met Met Leu Phe Leu Thr lie Leu Gly 

180 185 190 



Leu Gly Ser Ala Val Ser Leu Thr lie Cys Gly Ala Gly Glu Glu Leu 
195 200 205 



Lys Gly Asn Lys Val Pro Glu Asp Arg Val Tyr Glu Glu Leu Asn lie 
210 215 220 



Tyr Ser Ala Thr Tyr Ser Glu Leu Glu Asp Pro Gly Glu Met Ser Pro 
225 230 235 240 



Pro lie Asp Leu 



<210> 43 
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<211> 1646 

<212> DMA 

<213> homo sapiens 

<220> 

<221> cDNA 

<222> (1) . . (1646) 

<220> 

<221> CDS 

<222> (99) . . (743) 

<300> 

<3 01> Adra, CN, Lelias, JM, Kobayashi , H, Kaghad, M, Morrison, P, 
Rowley, JD, Lim, B 

<302> Cloning of the cDNA for a hematopoietic cell - specif ic protein 
related to CD20 and the beta subunit of the high-affinity IgE receptor: 
evidence for a family of proteins with four membrane -spanning regions 

<303> Proc Natl Acad Sci USA 

<304> 91 

<305> 21 

<306> 10178-10182 

<307> 1994- - 

<309> 



<400> 43 

tagtgatctt ttctgagtgt ctcctacttg cgacaaggtg gacttgggag gaaagccgtc 60 

tgccaaagcc tgaagcctcc aagccataaa caacccca atg gcc tec cac gaa gtt 116 

Met Ala Ser His Glu Val 
1 5 

gat aat gca gag ctg ggg tea gcc tct gcc cat ggt acc cca ggc agt 164 
Asp Asn Ala Glu Leu Gly Ser Ala Ser Ala His Gly Thr Pro Gly Ser 

10 15 20 

gag acg gga cca gaa gag ctg aat act tct gtc tac cac ccc ata aat 212 
Glu Thr Gly Pro Glu Glu Leu Asn Thr Ser Val Tyr His Pro lie Asn 
25 30 35 
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gga tea cca gat tat cag aaa gca aaa tta caa gtt ctt ggg gec ate 260 
Gly Ser Pro Asp Tyr Gin Lys Ala Lys Leu Gin Val Leu Gly Ala lie 
40 45 50 

cag ate ctg aat gca gca atg att ctg get ttg ggt gtc ttt ctg ggt 308 
Gin lie Leu Asn Ala Ala Met lie Leu Ala Leu Gly Val Phe Leu Gly 
55 60 65 70 

tec ttg caa tac cca tac cac ttc caa aag cac ttc ttt ttc ttc acc 356 
Ser Leu Gin Tyr Pro Tyr His Phe Gin Lys His Phe Phe Phe Phe Thr 

75 80 85 

ttc tac aca ggc tac ccg att tgg ggt get gtg ttt ttc tgt agt tea 404 
Phe Tyr Thr Gly Tyr Pro lie Trp Gly Ala Val Phe Phe Cys Ser Ser 

90 95 100 

gga acc ttg tct gtt gta gca ggg ata aaa ccc aca aga aca tgg ata 452 
Gly Thr Leu Ser Val Val Ala Gly lie Lys Pro Thr Arg Thr Trp lie 
105 110 115 

cag aac agt ttt gga atg aac att gee agt get aca att gca eta gtg 500 
Gin Asn Ser Phe Gly Met Asn lie Ala Ser Ala Thr lie Ala Leu Val 
120 125 130 

ggg act get ttt etc tea eta aat ata gca gtt aat ate cag tea tta 548 
Gly Thr Ala Phe Leu Ser Leu Asn lie Ala Val Asn lie Gin Ser Leu 
135 140 145 150 

a-99 a gt tgt cac tct tea tea gag tea ccg gac eta tgc aat tac atg 596 
Arg Ser Cys His Ser Ser Ser Glu Ser Pro Asp Leu Cys Asn Tyr Met 

155 160 165 

ggc tec ata tea aat ggc atg gtg tct eta ctg ctg att etc acc ttg 644 
Gly Ser lie Ser Asn Gly Met Val Ser Leu Leu Leu lie Leu Thr Leu 

170 175 180 

ctg gaa tta tgc gta act ate tct acc ata gec atg tgg tgc aat gca 692 
Leu Glu Leu Cys Val Thr lie Ser Thr lie Ala Met Trp Cys Asn Ala 
185 190 195 

aac tgc tgt aat tea aga gag gaa att tec tea cct ccc aat tct gtg 740 
Asn Cys Cys Asn Ser Arg Glu Glu lie Ser Ser Pro Pro Asn Ser Val 
200 205 210 

taa tcaagaatac ctccttatga aaataattct gagagcatga atatttgacc 793 

ttaaatctcc agtgactcag agcttcaccc acaaactcag gagaacataa gcctgctcgt 853 

aaagctcaat ccttctatca tggcaccaat cacaagaacc ttggacgttt gactgactct 913 

atcctttctc tcctaactat aaatcctatt tgtgtgtcgt gggtatggaa ggacagatat 973 

atttctttag gcattcttgg atatctgtaa cttctatgat cattactcca aagttgtttc 1033 

cagaaattgg ttctatttct tcttatccac ctactccatt gctttatgag gtttaaggaa 1093 

ggaaggcggt ataatcccta ttcaatatat tttttctaaa atccaacttc tgaccgccca 1153 

gtaggaagaa aaatgagaca ttttttccat tacagagaaa tgcttcttga ctttaacatc 1213 
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agcattataa 


aaagtgtcaa 


ataaaaaatt 


accatcatta 


tcattaaaat 


aaattttcac 


tgtatttgag 


atgggagggt 


taaggctcag 


ggattttatt 


tcagtgaact 


gctggaactc 


acacatgccc 


tgatatgtaa 


atgatgattt 


atgttggcga 


gtctgagagc 


aagcccaaat 


gtgttcttca 


aaggacaatg 


ggaaactgta 


aagtagagaa 


ctaaagaata 


aggcctttag 


a ss i~ t~* t~ n a r* a r 1 
ctct i— i_ L y d U a L. 




CL CL CL L> I— ' Uy CI CL 


a. v_- L.y L. v., d L» 


C*. 1— l_ CL V — ^ i— ■ y t_ CI 


uyaai^auyyy 


caaattatct 


aatctctctg 


atctattttt 


cctcatctgt 


aaaataggtg 


taataataac 


aactactttg 


tcggttgctc 


tgagggttaa 


atgaaaataa 


aaagaaaatg 


tgaaacagca 


ccacaggtac 


ttg 











<210> 44 

<211> 214 

<212> PRT 

<213> homo sapiens 



<400> 44 

Met Ala Ser His Glu Val Asp Asn Ala Glu Leu Gly Ser Ala Ser Ala 
15 10 15 



His Gly Thr Pro Gly Ser Glu Thr Gly Pro Glu Glu Leu Asn Thr Ser 

20 25 30 



Val Tyr His Pro lie Asn Gly Ser Pro Asp Tyr Gin Lys Ala Lys Leu 
35 40 45 



Gin Val Leu Gly Ala lie Gin lie Leu Asn Ala Ala Met lie Leu Ala 
50 55 60 



Leu Gly Val Phe Leu Gly Ser Leu Gin Tyr Pro Tyr His Phe Gin Lys 
65 70 75 80 



His Phe Phe Phe Phe Thr Phe Tyr Thr Gly Tyr Pro lie Trp Gly Ala 

85 90 95 



Val Phe Phe Cys Ser Ser Gly Thr Leu Ser Val Val Ala Gly lie Lys 

100 105 110 



Pro Thr Arg Thr Trp lie Gin Asn Ser Phe Gly Met Asn lie Ala Ser 
115 120 125 



1273 
1333 
1393 
1453 
1513 
1573 
1633 
1646 
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Ala Thr lie Ala Leu Val Gly Thr Ala Phe Leu Ser Leu Asn lie Ala 
130 135 140 



Val Asn lie Gin Ser Leu Arg Ser Cys His Ser Ser Ser Glu Ser Pro 
145 150 155 160 



Asp Leu Cys Asn Tyr Met Gly Ser lie Ser Asn Gly Met Val Ser Leu 

165 170 175 



Leu Leu lie Leu Thr Leu Leu Glu Leu Cys Val Thr lie Ser Thr lie 

180 185 190 



Ala Met Trp Cys Asn Ala Asn Cys Cys Asn Ser Arg Glu Glu lie Ser 
195 200 205 



Ser Pro Pro Asn Ser Val 
210 



<210> 45 

<211> 1182 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDMA 

<222> (1) . . (1182) 



<220> 

<221> CDS 

<222> (58) . . (861) 



<400> 45 

acacaggttg gagcagagaa agaggaaaca tagaggtgcc aaaggaacaa agacata 57 

atg atg tea tec aag cca aca age cat get gaa gta aat gaa acc ata 105 
Met Met Ser Ser Lys Pro Thr Ser His Ala Glu Val Asn Glu Thr lie 
15 10 15 

ccc aac cct tac cca cca ggc age ttt atg get cct gga ttt caa cag 153 
Pro Asn Pro Tyr Pro Pro Gly Ser Phe Met Ala Pro Gly Phe Gin Gin 

20 25 30 
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cct ctg ggt 
Pro Leu Gly 
35 

get cag ccc 
Ala Gin Pro 
50 

ccg ggt caa 
Pro Gly Gin 
65 

gta atg aac 
Val Met Asn 



atg gtt gga 
Met Val Gly 



tec ttc tct 
Ser Phe Ser 
115 

ggt gga tac 
Gly Gly Tyr 
130 



tea ate aac 
Ser lie Asn 



tac ggc ate 
Tyr Gly lie 



gga aat ata 
Gly Asn lie 
70 

ttt aaa gaa 
Phe Lys Glu 
85 

ttg atg cac 
Leu Met His 
100 

ttt aga gaa 
Phe Arg Glu 



cca ttc tgg 
Pro Phe Trp 



tta gaa aac 
Leu Glu Asn 
40 

aca tct ccg 
Thr Ser Pro 
55 

caa atg ata 
Gin Met lie 



gaa gca aag 
Glu Ala Lys 



att ggt ttt 
lie Gly Phe 
105 

gta tta ggt 
Val Leu Gly 
120 

ggt ggc ctt 
Gly Gly Leu 
135 



caa get cag 
Gin Ala Gin 



gga ate ttt 
Gly lie Phe 
60 

aat cca agt 
Asn Pro Ser 
75 

gca eta ggg 
Ala Leu Gly 
90 

gga att gtt 
Gly lie Val 



ttt gee tct 
Phe Ala Ser 



tct ttt att 
Ser Phe lie 
140 



ggt get cag 
Gly Ala Gin 
45 

get age agt 
Ala Ser Ser 



gtg gga aca 
Val Gly Thr 



gtg ate cag 
Val lie Gin 
95 

ttg tgt tta 
Leu Cys Leu 
110 

act get gtt 
Thr Ala Val 
125 

ate tct ggc 
lie Ser Gly 



cgt 2 01 

Arg 



caa 249 
Gin 



gca 297 

Ala 

80 

ate 345 
He 



ata 393 
He 



att 441 
He 



tct 489 
Ser 



etc tct gtg tea gca tec aag gag ctt tec cgt tgt ctg gtg aaa ggc 537 
Leu Ser Val Ser Ala Ser Lys Glu Leu Ser Arg Cys Leu Val Lys Gly 
145 L50 155 160 

age ctg gga atg aac att gtt agt tct ate ttg gee ttc att gga gtg 585 
Ser Leu Gly Met Asn He Val Ser Ser He Leu Ala Phe He Gly Val 

165 170 175 

att ctg ctg ctg gtg gat atg tgc ate aat ggg gta get ggc caa gac 633 
He Leu Leu Leu Val Asp Met Cys He Asn Gly Val Ala Gly Gin Asp 

180 185 190 

tac tgg gee gtg ctt tct gga aaa ggc att tea gee acg ctg atg ate 681 
Tyr Trp Ala Val Leu Ser Gly Lys Gly He Ser Ala Thr Leu Met He 
195 200 205 

ttc tec etc ttg gag ttc ttc gta get tgt gee aca gee cat ttt gee 729 
Phe Ser Leu Leu Glu Phe Phe Val Ala Cys Ala Thr Ala His Phe Ala 
210 215 220 

aac caa gca aac ace aca ace aat atg tct gtc ctg gtt att cca aat 777 
Asn Gin Ala Asn Thr Thr Thr Asn Met Ser Val Leu Val He Pro Asn 
225 230 235 240 

atg tat gaa age aac cct gtg aca cca gcg tct tct tea get cct ccc 825 
Met Tyr Glu Ser Asn Pro Val Thr Pro Ala Ser Ser Ser Ala Pro Pro 

245 250 255 

aga tgc aac aac tac tea get aat gee cct aaa tag taaaagaaaa 871 
Arg Cys Asn Asn Tyr Ser Ala Asn Ala Pro Lys 

260 265 
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aggggtatca gtctaatctc atggagaaaa actacttgca aaaacttctt aagaagatgt 931 

cttttattgt ctacaatgat ttctagtctt taaaaactgt gtttgagatt tgtttttagg 991 

ttggtcgcta atgatggctg tatctccctt cactgtctct tcctacatta ccactactac 1051 

atgctggcaa aggtgaagga tcagaggact gaaaaatgat tctgcaactc tcttaaagtt 1111 

agaaatgttt ctgttcatat tactttttcc ttaataaaat gtcattagaa acaaaaaaaa 1171 

aaaaaaaaaa a 1182 



<210> 46 

<211> 267 

<212> PRT 

<213> Homo sapiens 



<400> 46 

Met Met Ser Ser Lys Pro Thr Ser His Ala Glu Val Asn Glu Thr lie 
15 10 15 



Pro Asn Pro Tyr Pro Pro Gly Ser Phe Met Ala Pro Gly Phe Gin Gin 

20 25 30 



Pro Leu Gly Ser lie Asn Leu Glu Asn Gin Ala Gin Gly Ala Gin Arg 
35 40 45 



Ala Gin Pro Tyr Gly lie Thr Ser Pro Gly lie Phe Ala Ser Ser Gin 
50 55 60 



Pro Gly Gin Gly Asn lie Gin Met lie Asn Pro Ser Val Gly Thr Ala 
65 70 75 80 



Val Met Asn Phe Lys Glu Glu Ala Lys Ala Leu Gly Val lie Gin He 

85 90 95 



Met Val Gly Leu Met His He Gly Phe Gly He Val Leu Cys Leu He 

100 105 110 



Ser Phe Ser Phe Arg Glu Val Leu Gly Phe Ala Ser Thr Ala Val He 
115 120 125 



Gly Gly Tyr Pro Phe Trp Gly Gly Leu Ser Phe He He Ser Gly Ser 
130 135 140 
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Leu Ser Val Ser Ala Ser Lys Glu Leu Ser Arg Cys Leu Val Lys Gly 
145 150 155 160 



Ser Leu Gly Met Asn lie Val Ser Ser lie Leu Ala Phe lie Gly Val 

165 170 175 



lie Leu Leu Leu Val Asp Met Cys lie Asn Gly Val Ala Gly Gin Asp 

180 185 190 



Tyr Trp Ala Val Leu Ser Gly Lys Gly lie Ser Ala Thr Leu Met lie 
195 200 205 



Phe Ser Leu Leu Glu Phe Phe Val Ala Cys Ala Thr Ala His Phe Ala 
210 215 220 



Asn Gin Ala Asn Thr Thr Thr Asn Met Ser Val Leu Val lie Pro Asn 
225 230 235 240 



Met Tyr Glu Ser Asn Pro Val Thr Pro Ala Ser Ser Ser Ala Pro Pro 

245 250 255 



Arg Cys Asn Asn Tyr Ser Ala Asn Ala Pro Lys 

260 265 



<210> 47 

<211> 1388 

<212> DNA 

<213> Mus musculus 



<220> 

<221> CDNA 

<222> (1) . . (1388) 



<220> 

<221> CDS 

<222> (178) . . (1053) 



-68- 



WO 02/062946 



PCT/US01/48437 



<300> 

<301> Tedder, TF , Klejman, G, Disteche, CM, Adler, DA, Schlossman, SF, 
Saito, H 

<302> Cloning of complementary DNA encoding a new mouse B lymphocyte 
differentiation antigen, homologous to the human Bl (CD20) antigen and 
localization of the gen to chromosome 19 

<3 03> J Immunol 

<304> 141 

<305> 12 

<306> 4388-4394 

<307> 1988- - 

<309> 



<400> 47 

gaattccttt tttttttttt tttttaaaga tttatttatt attatatgta agtacactgt 60 

agctatcttc aagtacttga gatagaagag gccaactgat ctcagctgtg agtggctaat 120 

ttggccctta agccttggag ccttggagcc ttggagaccc aggcgtttga aaactca 177 

atg agt gga cct ttc cca gca gag cct aca aaa ggt ccc etc gec atg 225 
Met Ser Gly Pro Phe Pro Ala Glu Pro Thr Lys Gly Pro Leu Ala Met 
15 10 15 

caa cct get cca aaa gtg aac etc aaa agg aca tct tea ctg gtg ggc 273 
Gin Pro Ala Pro Lys Val Asn Leu Lys Arg Thr Ser Ser Leu Val Gly 

20 25 30 

ccc aca caa age ttc ttc atg agg gaa tea aag get ttg ggg get gtc 321 
Pro Thr Gin Ser Phe Phe Met Arg Glu Ser Lys Ala Leu Gly Ala Val 
35 40 45 

caa ate atg aat ggc etc ttc cat att acc ctg ggg gga ctg ctg atg 369 
Gin lie Met Asn Gly Leu Phe His lie Thr Leu Gly Gly Leu Leu Met 
50 55 60 

ate ccc aca ggg gtc ttc gca ccc ate tgt ttg agt gta tgg tac cct 417 
lie Pro Thr Gly Val Phe Ala Pro lie Cys Leu Ser Val Trp Tyr Pro 
65 70 75 80 

etc tgg gga ggc att atg tac att att tea gga tea etc ctg gca get 465 
Leu Trp Gly Gly lie Met Tyr lie lie Ser Gly Ser Leu Leu Ala Ala 

85 90 95 

gca gca gaa aaa acc tec agg aag agt ttg gtc aaa gca aaa gtg ata 513 
Ala Ala Glu Lys Thr Ser Arg Lys Ser Leu Val Lys Ala Lys Val lie 

100 105 110 

atg age tct eta age etc ttt get gee att tct gga ata att ctt tea 561 
Met Ser Ser Leu Ser Leu Phe Ala Ala lie Ser Gly lie lie Leu Ser 
115 120 125 
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ate atg gac ata ctt aac atg aca ctt tct cat ttt tta aaa atg aga 609 
lie Met Asp lie Leu Asn Met Thr Leu Ser His Phe Leu Lys Met Arg 
130 135 140 

aga ctg gag ctt att caa act tec aag ccg tat gtt gat ate tac gac 657 
Arg Leu Glu Leu He Gin Thr Ser Lys Pro Tyr Val Asp He Tyr Asp 
145 150 155 160 

tgt gaa cca tct aat tec tea gag aaa aac tec cca tct aca cag tac 705 
Cys Glu Pro Ser Asn Ser Ser Glu Lys Asn Ser Pro Ser Thr Gin Tyr 

165 170 175 

tgt aac age att cag tct gtg ttc ttg ggc att ctg teg gcg atg ctg 753 
Cys Asn Ser He Gin Ser Val Phe Leu Gly He Leu Ser Ala Met Leu 

180 185 190 

ate tct gee ttc ttc cag aaa ctt gtg aca get ggt att gtg gag aat 801 
He Ser Ala Phe Phe Gin Lys Leu Val Thr Ala Gly He Val Glu Asn 
195 200 205 

gag tgg aaa aga atg tgt acc aga tec aaa tct aat gtg gtt ctg ctg 84 9 

Glu Trp Lys Arg Met Cys Thr Arg Ser Lys Ser Asn Val Val Leu Leu 
210 215 220 

tea get gga gaa aaa aat gag cag acg att aaa atg aaa gaa gaa ate 897 
Ser Ala Gly Glu Lys Asn Glu Gin Thr He Lys Met Lys Glu Glu He 
225 230 235 240 

att gag eta agt gga gta tct tec caa cca aag aat gaa gag gaa att 945 
He Glu Leu Ser Gly Val Ser Ser Gin Pro Lys Asn Glu Glu Glu He 

245 250 255 

gaa att att cca gtg cag gag gaa gaa gaa gaa gaa gca gaa ata aat 993 
Glu He He Pro Val Gin Glu Glu Glu Glu Glu Glu Ala Glu He Asn 

260 265 270 

ttt cca gca cct ccc caa gag cag gaa tec ttg cca gtg gaa aat gag 1041 
Phe Pro Ala Pro Pro Gin Glu Gin Glu Ser Leu Pro Val Glu Asn Glu 
275 280 285 

ate get cct taa actcttttct tttctaagca ttattgttta gagagcttcc 1093 
He Ala Pro 
290 

aagacacata gttaccctca tctcttgtgg ccttccacaa tctattctcc atattttcac 1153 

agcttaactt tgcatagaga agccacatct agctctcctt cacatttgaa gaatgcagtg 1213 

attataaaag attgtctttt gecttgetta gggagtctta cactggcaga aacgetgaag 1273 

aatccaattc tcattcacct tttccttgga tgtgtgtctc agtagtggta atggtttttc 1333 

cgcatttcct ccatcagcag ttacagcaga aggagtcaga gagttcaggg aattc 1388 

<210> 48 
<211> 291 
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<212> PRT 

<213> Mus musculus 



<400> 48 

Met Ser Gly Pro Phe Pro Ala Glu Pro Thr Lys Gly Pro Leu Ala Met 
15 10 15 



Gin Pro Ala Pro Lys Val Asn Leu Lys Arg Thr Ser Ser Leu Val Gly 

20 25 30 



Pro Thr Gin Ser Phe Phe Met Arg Glu Ser Lys Ala Leu Gly Ala Val 
35 40 45 



Gin lie Met Asn Gly Leu Phe His lie Thr Leu Gly Gly Leu Leu Met 
50 55 60 



lie Pro Thr Gly Val Phe Ala Pro lie Cys Leu Ser Val Trp Tyr Pro 
65 70 75 80 



Leu Trp Gly Gly lie Met Tyr lie lie Ser Gly Ser Leu Leu Ala Ala 

85 90 95 



Ala Ala Glu Lys Thr Ser Arg Lys Ser Leu Val Lys Ala Lys Val lie 

100 105 110 



Met Ser Ser Leu Ser Leu Phe Ala Ala lie Ser Gly lie lie Leu Ser 
115 120 125 



lie Met Asp lie Leu Asn Met Thr Leu Ser His Phe Leu Lys Met Arg 
130 135 140 



Arg Leu Glu Leu lie Gin Thr Ser Lys Pro Tyr Val Asp lie Tyr Asp 
145 150 155 160 



Cys Glu Pro Ser Asn Ser Ser Glu Lys Asn Ser Pro Ser Thr Gin Tyr 

165 170 175 



Cys Asn Ser lie Gin Ser Val Phe Leu Gly lie Leu Ser Ala Met Leu 

180 185 190 



lie Ser Ala Phe Phe Gin Lys Leu Val Thr Ala Gly lie Val Glu Asn 
195 200 205 



Glu Trp Lys Arg Met Cys Thr Arg Ser Lys Ser Asn Val Val Leu Leu 
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210 215 220 



Ser Ala Gly Glu Lys Asn Glu Gin Thr lie Lys Met Lys Glu Glu lie 
225 230 235 240 



lie Glu Leu Ser Gly Val Ser Ser Gin Pro Lys Asn Glu Glu Glu lie 

245 250 255 



Glu lie lie Pro Val Gin Glu Glu Glu Glu Glu Glu Ala Glu lie Asn 

260 265 270 



Phe Pro Ala Pro Pro Gin Glu Gin Glu Ser Leu Pro Val Glu Asn Glu 
275 280 285 



lie Ala Pro 
290 



<210> 49 

<211> 1010 

<212> DMA 

<213> Mus musculus 



<220> 

<221> cDNA 

<222> (1) . . (1010) 



<220> 

<221> CDS 

<222> (23) . . (730) 



<300> 

<301> Ra, C, Jouvin, M-He, Kinet, J-P 

<302> Complete structure of the mouse mast cell receptor for IgE 
(FceRI) and surface expression of chimeric receptors (rat -mouse -human) 
on transfected cells 

<303> J Biol Chem 

<304> 264 
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10 



15 



35 



55 



<305> 26 

<306> 15323-15327 

<307> 1989- - 

<309> 



<400> 49 

ggatagccca attaatgaaa aa atg gac aca gaa aat agg age aga gca gat 52 

Met Asp Thr Glu Asn Arg Ser Arg Ala Asp 
15 10 

ctt get etc cca aat cca caa gaa tec tec agt gca cct gac att gaa 100 
Leu Ala Leu Pro Asn Pro Gin Glu Ser Ser Ser Ala Pro Asp lie Glu 

15 20 25 



20 etc ttg gaa gca tct cct gee aaa gca gee cca cca aag cag aca tgg 148 

Leu Leu Glu Ala Ser Pro Ala Lys Ala Ala Pro Pro Lys Gin Thr Trp 

30 35 40 

egg aca ttt ttg aag aaa gag ttg gag ttc ctg gga gca aca caa att 196 
25 Arg Thr Phe Leu Lys Lys Glu Leu Glu Phe Leu Gly Ala Thr Gin lie 

45 50 55 

ct 9 gtt 99 1 fct 9 ata t9 c ctt tgt ttt gga aca att gtc tgc tec gta 244 
Leu Val Gly Leu lie Cys Leu Cys Phe Gly Thr lie Val Cys Ser Val 
30 60 65 70 

etc tat gtt tea gac ttt gat gaa gaa gtg ctt tta ctt tat aaa eta 292 
Leu Tyr Val Ser Asp Phe Asp Glu Glu Val Leu Leu Leu Tyr Lys Leu 



75 80 85 90 

ggc tat cca ttc tgg ggt gca gtg ctg ttt gtt ttg tct gga ttt ttg 340 
Gly Tyr Pro Phe Trp Gly Ala Val Leu Phe Val Leu Ser Gly Phe Leu 

95 100 105 



40 tea att ate tec gaa aga aaa aac aca ttg tat ctg gtg aga ggc age 3 88 

Ser lie lie Ser Glu Arg Lys Asn Thr Leu Tyr Leu Val Arg Gly Ser 

110 115 120 

ctg 99 a gca aac att gtc agt age ate get gca ggg acg ggg ate gee 43 6 

45 Leu Gly Ala Asn lie Val Ser Ser lie Ala Ala Gly Thr Gly lie Ala 

125 130 135 

atg ctg ate etc aat ctg ace aat aac ttc get tat atg aac aac tgc 484 

Met Leu lie Leu Asn Leu Thr Asn Asn Phe Ala Tyr Met Asn Asn Cys 
50 140 145 150 

aag aat gta ace gaa gac gac ggc tgc ttt gtg get tct ttc ace aca 532 

Lys Asn Val Thr Glu Asp Asp Gly Cys Phe Val Ala Ser Phe Thr Thr 



155 160 165 170 

gaa ctg gtg ttg atg atg ctg ttt etc ace ate ctg gee ttt tgc agt 580 
Glu Leu Val Leu Met Met Leu Phe Leu Thr lie Leu Ala Phe Cys Ser 

175 180 185 



60 get gtg ttg ttc act ate tat agg att gga caa gag tta gaa agt aaa 62 8 

Ala Val Leu Phe Thr lie Tyr Arg lie Gly Gin Glu Leu Glu Ser Lys 
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190 195 200 

aag gtc cca gat gat cgt ctt tat gaa gaa tta aat gtg tat tea cca 676 

Lys Val Pro Asp Asp Arg Leu Tyr Glu Glu Leu Asn Val Tyr Ser Pro 
205 210 215 

att tac agt gag ttg gaa gac aaa ggg gaa aca tct tct cca gtt gat 724 

lie Tyr Ser Glu Leu Glu Asp Lys Gly Glu Thr Ser Ser Pro Val Asp 

220 225 230 

tea taa gaatcagggg accaggacaa tctgattcaa gtataatctt gaaagttgat 780 

Ser 

235 

ctttttacaa aattctcgea aaatttctgt ttgttccaca ttctgtcagt ttttcaattg 840 

gattgttctg cagatgccac tcttttagtt atgctgtatc tgatcttcta aatatctccc 900 

tttttgcgga tatcattcac tccaattttc ttgttttgtg tcacaatttc acatacatct 960 

tttctggaaa gtcatcaagg aataagttgg ctttattgta tgtctacttt 1010 



<210> 50 

<211> 235 

<212> PRT 

<213> Mus musculus 



<400> 50 

Met Asp Thr Glu Asn Arg Ser Arg Ala Asp Leu Ala Leu Pro Asn Pro 
15 10 15 



Gin Glu Ser Ser Ser Ala Pro Asp lie Glu Leu Leu Glu Ala Ser Pro 

20 25 30 



Ala Lys Ala Ala Pro Pro Lys Gin Thr Trp Arg Thr Phe Leu Lys Lys 
35 40 45 



Glu Leu Glu Phe Leu Gly Ala Thr Gin lie Leu Val Gly Leu He Cys 
50 55 60 



Leu Cys Phe Gly Thr He Val Cys Ser Val Leu Tyr Val Ser Asp Phe 
65 70 75 80 



Asp Glu Glu Val Leu Leu Leu Tyr Lys Leu Gly Tyr Pro Phe Trp Gly 

85 90 95 



Ala Val Leu Phe Val Leu Ser Gly Phe Leu Ser He He Ser Glu Arg 

100 105 110 
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Lys Asn Thr Leu Tyr Leu Val Arg Gly Ser Leu Gly Ala Asn lie Val 
115 120 125 



Ser Ser lie Ala Ala Gly Thr Gly lie Ala Met Leu lie Leu Asn Leu 
130 135 140 



Thr Asn Asn Phe Ala Tyr Met Asn Asn Cys Lys Asn Val Thr Glu Asp 
145 150 155 160 



Asp Gly Cys Phe Val Ala Ser Phe Thr Thr Glu Leu Val Leu Met Met 

165 170 175 



Leu Phe Leu Thr lie Leu Ala Phe Cys Ser Ala Val Leu Phe Thr lie 

180 185 190 



Tyr Arg lie Gly Gin Glu Leu Glu Ser Lys Lys Val Pro Asp Asp Arg 
195 200 205 



Leu Tyr Glu Glu Leu Asn Val Tyr Ser Pro lie Tyr Ser Glu Leu Glu 
210 215 220 



Asp Lys Gly Glu Thr Ser Ser Pro Val Asp Ser 
225 230 235 



<210> 51 

<211> 2542 

<212> DNA 

<213> rat 



<220> 

<221> CDNA 

<222> (1) . . (2542) 



<400> 51 

acgtttctgt gtaacaatat cttttattcc tggatagtcc aattaatgaa aaaaatggac 60 

acagaaaata agagcagagc agatcttgct ctcccaaacc cacaagaatc ccccagcgca 12 0 

cctgacattg aactcttgga agcgtcccct cctgcaaaag ctctaccaga gaagccagcc 180 

tcacccccac cacagcagac atggcagtca tttttgaaga aagagttgga gttcctgggc 240 
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gtaacccaag 


ttctggttgg 


tttgatatgc 


5 


ctccagactt 


cagactttga 


cgacgaagtg 


tggggtgcag 


tgctgtttgt 


tttgtctgga 




a a f i~ f~r t~ a +* r"> 
clv^ at l_y UaLL 


cyy ty ciy ciyy 


Lay L.L. tyyy a 


10 


ttggggatcg 


ccatattgat 


tctcaatctg 




aaggatataa 


ccgaagacga 


tggttgcttc 


15 


atgttgctgt 


ttctcaccat 


cctggccttt 




attggacaag 


aatttgagcg 


tagtaaggtc 




y y La l LLaL 


v_ d ci u L- L- ex cty 


L-y ^ y i-i-yycici 


20 


tcataagaat 


caaggggcca 


ggacaatctg 




ttgcaacatt 


atggcaacat 


ttctgtttcc 


25 


gttctgtaga 


tacccctgtt 


tcagttatga 




gtggatatcg 


ttcaatccag 


ttttcttgtt 




y y ct ci cty l l. a. l 


^cxctcictci^ctciy 


rhsnrh t~ t~ t~ ^ 

v^L.dyv^.L.^U-l—Cl 


30 


ggaggagtta 


ttttgagagt 


ttaactaaac 




ttcattttag 


aattctcaac 


aatacttgtg 


35 


tatatatatt 


taatatggta 


tttacttttc 




aatgggtttt 


tcagagacat 


aaagctttat 




d l— CL U CI l~ d 


3l~3.f"'3cr1~1 - cit" 

a i— . gl\_* ciy i— ^y *— 


i~y i— "-y uyy 


40 


gagaacatta 


ttaaatcatt 


cagtgtagtg 




agtgtgtata 


tcttttggta 


agacatgtgt 


45 


agctggaaag 


tagtcaatgg 


gaatgacaag 




taactaccaa 


caattggtag 


agattgccat 




y ^ ll l y L» CL 


v*. cl ^ q. y i— y c*. y i— ■ 


uciy ciy y Vw v_ ciy 


50 


acgaaagaag 


atgttggttc 


aaactgttaa 




tttgactgat 


atattgagtg 


cttgagtttt 


55 


gaggggagca 


gtgatgcaga 


agtctggatg 




tgaaacgtat 


ttccagactt 


ttgttctgag 




agaaggtcaa 


gacaccaaaa 


gaattatttt 


60 


tgcctatact 


tagaaggtga 


agtaaggatc 



ctttgttttg 


gaacagttgt 


ctgctccaca 


300 


cttttattat 


atagagcagg 


ctacccattc 


360 


tttttgtcaa 


ttatgtccga 


aaggaaaaac 


420 


gcaaacattg 


tcagcagcat 


cgctgcaggc 


480 


agcaacaact 


ccgcttatat 


gaactactgc 


540 


gtgacttctt 


tcatcacaga 


actggtgttg 


600 


tgcagtgccg 


tgctgctcat 


tatctatagg 


660 


cccgatgacc 


gtctctatga 


agaattacat 


720 


gacacaaggg 


aagcgtccgc 


accagtggtt 


780 


attccagtat 


agtcttgaga 


gtcgatcttt 


840 


tccgcactct 


atcaactttt 


caattggatt 


900 


tgcctctggt 


ctttaattat 


ctcccttttt 


960 


ttgtgtcaca 


gtctcacata 


caacctttct 


1020 


ttgcatgtct 


actttcatga 


acaaaaggaa 


1080 


ttagataatc 


aggtaatatt 


tgactcttag 


1140 


catgatatat 


gcccaccata 


tcaagccttc 


1200 


tatgtagata 


gattttccac 


cctcaataat 


1260 


gaaaagacac 


atattatcta 


attcatgggt 


1320 


tgtttactac 


ttggtgggta 


gtaggtaata 


1380 


agatgcatag 


gtaaaatcag 


ggacactgtg 


1440 


gaaaatgaag 


aataaactga 


tgaagacttg 


1500 


aaatgattgt 


gtataacact 


tgtagataaa 


1560 


gtatgcctaa 


aatctcccag 


cccaaggcca 


1620 


tctgggctac 


acaagatcat 


acatcaaagg 


1680 


cacagtaagg 


gatatttaaa 


caaacagaag 


1740 


taataaaact 


gaatgaataa 


cattgcgggg 


1800 


atggaggagt 


agcagaatca 


gatgaaacat 


1860 


atggttataa 


gagcaatcac 


cattaaatga 


1920 


gagatagaat 


taagacagtc 


aaaatccaca 


1980 


aaaagtagaa 


agcctaacga 


ttagttggaa 


2040 
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aagcatatta 


cgttaggcag 


cagatgtcta 


tagtggagaa 


aagttaaaca 


aggagaaata 


atgaaccacc 


agagactcta 


catgttggtt 


tgggaaataa 


gagaaaatag 


caattctaaa 


cgaatgcaaa 


ctctgaagaa 


gcatttccca 


aagggtgtgg 


gcagaggacc 


agaacatttg 


caaatgtacc 


tagagagcaa 


acctgaatag 


gaggtaaaat 




cagctaagaa 


aatgattttg 


ttgctgttat 


ttagatttta 


aaagaaacaa 


aaagagtcat 


taaaaatctg 


tttactaaaa 


t cacrt t at ta 


tcrttctctcrt 


crtatqtccaa 


agtacaggt a 


acttttc taa 


atcttcctgt 


aaggctcacc 


tcatatgtct 


cttcacatag 


ccacaccctt 


gattcacagt 


tactctacca 


cagtagtaaa 


ctgtgcttgt 


ggtctcctta 


tgtatcttcc 


actagtgttt 


ataaaataaa 


tcagaattat 


tt 









<210> 52 

<211> 243 

<212> PRT 

<213> rat 



<400> 52 

Met Asp Thr Glu Asn Lys Ser Arg Ala Asp Leu Ala Leu Pro Asn Pro 
15 10 15 

Gin Glu Ser Pro Ser Ala Pro Asp lie Glu Leu Leu Glu Ala Ser Pro 

20 25 30 

Pro Ala Lys Ala Leu Pro Glu Lys Pro Ala Ser Pro Pro Pro Gin Gin 
35 40 45 

Thr Trp Gin Ser Phe Leu Lys Lys Glu Leu Glu Phe Leu Gly Val Thr 
50 55 60 

Gin Val Leu Val Gly Leu lie Cys Leu Cys Phe Gly Thr Val Val Cys 
65 70 75 80 

Ser Thr Leu Gin Thr Ser Asp Phe Asp Asp Glu Val Leu Leu Leu Tyr 

85 90 95 

Arg Ala Gly Tyr Pro Phe Trp Gly Ala Val Leu Phe Val Leu Ser Gly 

100 105 110 

Phe Leu Ser lie Met Ser Glu Arg Lys Asn Thr Leu Tyr Leu Val Arg 
115 120 125 

Gly Ser Leu Gly Ala Asn lie Val Ser Ser lie Ala Ala Gly Leu Gly 
130 135 140 

lie Ala lie Leu lie Leu Asn Leu Ser Asn Asn Ser Ala Tyr Met Asn 
145 150 155 160 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2542 
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Tyr Cys Lys Asp lie Thr Glu Asp Asp Gly Cys Phe Val Thr Ser Phe 

165 170 175 

lie Thr Glu Leu Val Leu Met Leu Leu Phe Leu Thr lie Leu Ala Phe 

180 185 190 

Cys Ser Ala Val Leu Leu lie lie Tyr Arg lie Gly Gin Glu Phe Glu 
195 200 205 

Arg Ser Lys Val Pro Asp Asp Arg Leu Tyr Glu Glu Leu His Val Tyr 
210 215 220 

Ser Pro lie Tyr Ser Ala Leu Glu Asp Thr Arg Glu Ala Ser Ala Pro 
225 230 235 240 

Val Val Ser 

<210> 53 

<211> 1323 

<212> DNA 

<213> dog 

<220> 

<221> cDNA 

<222> (1) . . (1323) 



<400> 53 

cccagctcac tgcaccaacg tccttccctg gccgacagct cggctaatga gaaaatggag 60 

ccagaaaaca gtggcagggc aggccttgct ctcccgaacc cgcagggacc ctccagtgca 120 

cctgaaattg aactttcaga agtacctctc cgtgaaaacc ccttactgga gaaggctggt 180 

tcatccccac cgtgccatac gtggctgatg tttctgaaga gggagctgga atttctgggg 240 

gtaacacaaa ttctgattgc cttggtatgt ctttgttttg gaataattgt ctgctccgtg 3 00 

atcaatattt cagaatttaa ggaagacatt ttttcatcat ttaaagcagg ctacccattc 360 

tggggagcag tgttttttgc tatttctgga tttttgccaa ttatgtctga aaagaaacat 420 

gctacatatc tggcgtgggg cagcctggga gccaacactg tccagatagc tgcgggaata 4 80 

ggaatcttta tctacggtca acctgcagac ctgcagagga gctcggctta catccacaac 540 

tgccagcaag ctcccagaga cgacttctgc tttgtggctt gtttttccac agaaattgtg 600 

gcaatgatcc tgtttctcac tattctgggg ttttgcagtg ctgtgtcact gacgatctat 660 

ggagttggag aattagtcca agggaacaag attgtagaag atcgtcttta tgaagaatta 72 0 
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aacatatatt 


caccgattta 


cagtgagctg 


gaagacaaag 


cggaggaatc 


ttctcctgtt 


gattcataag 


aattgtgttc 


agaacattct 


cacagaaaag 


ggtctagaat 


aagccaagat 


ctttgttaaa 


ggggctacag 


gcaaaatttt 


gattcttgtc 


tatgatctgc 


tagttttaca 


ctaggattta 


ttctccaaat 


ggtatatgct 


gcctgttgtt 


tctgttcacc 


taaaattccc 


ctcccaccac 


ccacttatgg 


gtatgtgata 


gactcctatt 


ttcttgtttc 


tattgctctc 


tggtagcact 


ttatggaagt 


cacgtgtaat 


gagcaagcat 


ttattctgtg 


cctataactg 




y y y aaLacty u 


yy ciciyyyy ul 




t~'t - ;^;^;=ar"' , 1 - 't~1 - ^'~ , 
i— l, aciau i— i— L- v — 




cacctgggtg 


gatccgtcaa 


ttaagagtct 


gactcttgac 


tgcagactca 


ggtcatgaga 


tcaagcccca 


tgtcaggctc 


ctcactcagt 


ggggagtctg 


cttgggattc 


tttctctcct 


tctccctcta 


tcccttcttc 


tgctctctgt 


ctcccaatct 


cataaataaa 


taaatataat 



ttt 

<210> 54 

<211> 137 

<212> PRT 

<213> dog 



<400> 54 

Met Asn Ser Gly Arg Ala Gly Ala Asn Gly Ser Ser Ala Ser Val Arg 
1 5 ' 10 15 

Asn Lys Ala Gly Ser Ser Cys His Thr Trp Met Lys Arg Gly Val Thr 

20 25 30 

Ala Val Cys Cys Gly Val Cys Ser Val Asn Ser Lys Asp Ser Ser Lys 
35 40 45 

Ala Gly Tyr Trp Gly Ala Val Ala Ser Gly Met Ser Lys Lys His Ala 
50 55 60 

Thr Tyr Ala Trp Gly Ser Gly Ala Asn Thr Val Ala Ala Gly Gly Tyr 
65 70 75 80 

Gly Ala Asp Arg Ser Ser Ala Tyr His Asn Cys Ala Arg Asp Asp Cys 

85 90 95 

Val Ala Cys Ser Thr Val Ala Met Thr Gly Cys Ser Ala Val Ser Thr 

100 105 110 

Tyr Gly Val Gly Val Gly Asn Lys Val Asp Arg Tyr Asn Tyr Ser Tyr 
115 120 125 

Ser Asp Lys Ala Ser Ser Val Asp Ser 
130 135 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1323 
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<210> 55 

<211> 1222 

<212> DNA 

<213> pig 



<220> 

<221> cDNA 

<222> (1) . . (1222) 



<220> 

<221> CDS 

<222> (96) . . (827) 



<400> 55 

gtaatatcca ggtaagtatc tcagattagc acagagcttg gagaagagag agacagagag 60 

aggaagccaa gaagtgccaa aggaacagag acaaa atg atg tea tec aag cca 113 

Met Met Ser Ser Lys Pro 
1 5 

aca ace tac cct gga gtg tat gga ace aca cct gat ctt tac cca cca 161 
Thr Thr Tyr Pro Gly Val Tyr Gly Thr Thr Pro Asp Leu Tyr Pro Pro 

10 15 20 

agt aac ttt atg gtg cct ggg tct caa caa ccc cca ggg ttc ate aac 209 
Ser Asn Phe Met Val Pro Gly Ser Gin Gin Pro Pro Gly Phe lie Asn 
25 30 35 

cca cga ate caa gtt cag agt age cag gca cct ttc ate gtc tct cca 257 
Pro Arg lie Gin Val Gin Ser Ser Gin Ala Pro Phe lie Val Ser Pro 
40 45 50 

gga ate ccc aat aac agt cag cag gtt caa gga aat ata caa atg gta 305 
Gly lie Pro Asn Asn Ser Gin Gin Val Gin Gly Asn lie Gin Met Val 
55 60 65 70 

aat cca ggt aca gga aaa gca gee aca aac ttt aaa gaa gaa gca aag 353 
Asn Pro Gly Thr Gly Lys Ala Ala Thr Asn Phe Lys Glu Glu Ala Lys 

75 80 85 

aca eta ggg gca ate cag ate ctg att gga tea atg cac att ggc ttc 401 
Thr Leu Gly Ala He Gin He Leu He Gly Ser Met His He Gly Phe 

90 95 100 

gga att att ctg ggc tta atg ggc cgc act tat atg caa gtt ttg ggt 449 
Gly He He Leu Gly Leu Met Gly Arg Thr Tyr Met Gin Val Leu Gly 
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105 110 115 

ttt get tec ttg get ttt gtt age gga tat cca ttc tgg ggc ggc etc 497 

Phe Ala Ser Leu Ala Phe Val Ser Gly Tyr Pro Phe Trp Gly Gly Leu 
120 125 130 

tct ttc att att act gga att etc tgt att ttg gca tec aag aag tct 545 

Ser Phe lie lie Thr Gly lie Leu Cys lie Leu Ala Ser Lys Lys Ser 

135 140 145 150 



tec cct get ctg ata aaa age age ctg gga atg agt att gtt agt tec 593 
Ser Pro Ala Leu lie Lys Ser Ser Leu Gly Met Ser lie Val Ser Ser 

155 160 165 

ttc ttt gee ttc att ggg atg ate ctg ttg ctg gtg gat gaa age ate 641 
Phe Phe Ala Phe lie Gly Met lie Leu Leu Leu Val Asp Glu Ser lie 

170 175 180 

aac ggg ctg ccc gag caa gac tac tgg gcg gtg ctt tct gga aaa ggc 68 9 

Asn Gly Leu Pro Glu Gin Asp Tyr Trp Ala Val Leu Ser Gly Lys Gly 
185 190 195 

att tea gee atg ctg att ate ttc tct etc ttg gag ttc tgc ata act 737 
lie Ser Ala Met Leu lie lie Phe Ser Leu Leu Glu Phe Cys lie Thr 
200 205 210 

tgt gtc aca gee tat ttt gee age cac acc ate ace aac ace agg ggt 785 
Cys Val Thr Ala Tyr Phe Ala Ser His Thr lie Thr Asn Thr Arg Gly 
215 220 225 230 

ctg tec tgg tea ttc cac ctg tgt atg caa aca gtc cct tga 827 
Leu Ser Trp Ser Phe His Leu Cys Met Gin Thr Val Pro 

235 240 

cgacaggatt gtcctcagtt cctcccccag ttccggccac ttagctcaca ccacctaata 887 

ccagtaggta ttaaaagaag aagaagcatc ctgeaattte atgggggtag aaccgctctg 947 

cagaaacttc ttaagaagtt ggtcttcgcc gttcacgatg attgetaate tttaaagact 1007 

gtgtttcagg ttctgttctt ggtttgtcat gaatgaaaac tctatggacc tctctaccag 1067 

tgtttcttcc tacatgccac taccacgtgc tggcaaaggt gaagggctgg aggactggaa 1127 

atgactctgc aactctctta aagttagaaa tttcctgtta aagttaccct tccctcaata 1187 

aaatgttact aggagcaaaa aaaaaaaaaa aaaaa 1222 

<210> 56 

<211> 243 

<212> PRT 

<213> pig 



<400> 56 



-81- 



WO 02/062946 



PCT/US01/48437 



Met Met Ser Ser 
1 



Pro Asp Leu Tyr 

20 



Pro Pro Gly Phe 
35 



Pro Phe lie Val 
50 



Gly Asn lie Gin 
65 



Phe Lys Glu Glu 



Ser Met His lie 

100 



Tyr Met Gin Val 
115 



Pro Phe Trp Gly 
130 



Leu Ala Ser Lys 
145 



Met Ser lie Val 



Leu Val Asp Glu 

180 



Val Leu Ser Gly 
195 



Leu Glu Phe Cys 
210 



lie Thr Asn Thr 
225 



Lys Pro Thr Thr 
5 



Pro Pro Ser Asn 



lie Asn Pro Arg 

40 



Ser Pro Gly lie 
55 



Met Val Asn Pro 
70 



Ala Lys Thr Leu 
85 



Gly Phe Gly lie 



Leu Gly Phe Ala 

120 



Gly Leu Ser Phe 
135 



Lys Ser Ser Pro 
150 



Ser Ser Phe Phe 
165 



Ser lie Asn Gly 



Lys Gly lie Ser 

200 



lie Thr Cys Val 
215 



Arg Gly Leu Ser 
230 



Tyr Pro Gly Val 
10 



Phe Met Val Pro 
25 



lie Gin Val Gin 



Pro Asn Asn Ser 

60 



Gly Thr Gly Lys 
75 



Gly Ala lie Gin 
90 



lie Leu Gly Leu 
105 



Ser Leu Ala Phe 



lie lie Thr Gly 

140 



Ala Leu lie Lys 
155 



Ala Phe lie Gly 
170 



Leu Pro Glu Gin 
185 



Ala Met Leu lie 



Thr Ala Tyr Phe 

220 



Trp Ser Phe His 
235 



Tyr Gly Thr Thr 
15 



Gly Ser Gin Gin 
30 



Ser Ser Gin Ala 
45 



Gin Gin Val Gin 



Ala Ala Thr Asn 

80 



lie Leu lie Gly 
95 



Met Gly Arg Thr 
110 



Val Ser Gly Tyr 
125 



lie Leu Cys lie 



Ser Ser Leu Gly 

160 



Met lie Leu Leu 
175 



Asp Tyr Trp Ala 
190 



lie Phe Ser Leu 
205 



Ala Ser His Thr 



Leu Cys Met Gin 

240 
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<210> 57 

<211> 6 

<212> PRT 

<213> Homo sapiens, Mus Musculus 



<220> 

<221> conserved_motif 

<222> (1) . . (6) 

<223> X is any amino acid 

<400> 57 



Leu Gly Ala Xaa Gin lie 
1 5 



<210> 58 

<211> 4 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 



<400> 58 

Leu Ser Leu Gly 
1 



<210> 59 

<211> 6 

<212> PRT 

<213> Homo sapiens, Mus musculus 



<220> 

<221> misc_feature 

<223> X is any amino acid 
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<400> 59 

Ser Gly Xaa Leu Ser lie 
1 5 



<210> 60 

<211> 6 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 



<400> 60 

Gly Tyr Pro Phe Trp Gly 
1 5 



<210> 61 

<211> 8 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 



<400> 61 

Phe lie lie Ser Gly Ser Leu Ser 
1 5 



<210> 62 

<211> 15 

<212> PRT 

<213> Homo Sapiens, Mus Musculus 



<220> 

<2 21> conservedjnotif 

<222> (1) . . (15) 

<223> X is any amino acid 
<400> 62 

Ser Leu Xaa Xaa Asn Xaa Xaa Ser Xaa Xaa Xaa Ala Xaa Xaa Gly 
15 10 15 
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<210> 63 

<211> 21 

<212> DNA 

<213> Mus musculus 

<400> 63 

cacgaggcac acaagcaaag c 21 

<210> 64 

<211> 25 

<212> DNA 

<213> Mus musculus 



<210> 65 

<211> 24 

<212> DNA 

<213> Mus musculus 

<400> 65 

tgggtgagaa cacacaatca aaac 24 

<210> 66 

<211> 24 

<212> DNA 

<213> Mus musculus 



<400> 64 

aagtgcttga cttacatact tacag 



25 



<400> 66 

cacatacaca caagagaatt agac 



24 



<210> 



67 



<211> 



24 
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<400> 67 

ccattgtctg tactgtttct gctg 24 



<210> 68 

<211> 23 

<212> DNA 

<213> Mus musculus 



<400> 68 

gccaaatgca tacacatgtg cac 23 



<210> 69 

<211> 24 

<212> DNA 

<213> Mus musculus 



<400> 69 

ctggaagtga ctgggtgaca aggc 24 



<210> 70 

<211> 22 

<212> DNA 

<213> Mus musculus 



<400> 70 

caatcgttcc tcatatgcac ag 22 



<210> 71 

<211> 25 

<212> DNA 

<213> Mus musculus 
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<400> 71 

agactctggt ggtcattact gtctc 



<210> 72 

<211> 22 

<212> DNA 

<213> Mus musculus 



<400> 72 

gaatgccaaa tgcacagaaa gg 

<210> 73 

<211> 12952 

<212> DNA 

<213> homo sapiens 



<220> 

<221> genomic_DNA 

<222> (1) . . (12952) 

<223> n is an undetermined nucleotide (dATP , dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding_region 

<222> (4390) . . (4548) 



<220> 

<221> coding_region 
<222> (5017) . . (5136) 



<220> 

<221> coding_region 
<222> (6303) . . (6359) 
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<220> 

<221> coding_region 

<222> (7813 ) . . (8049) 

<220> 

<221> coding_region 

<222> (8851) . . (8952) 

<220> 

<221> coding_region 

<222> (10142) . . (10357) 



<400> 73 

tatctcatat tctcaaaatg atactttgaa atggggaagg taactattac cttaaactta 60 

cagagagacc acaagaggag caagcaagag agctgggact aactctgtct gctacatatt 12 0 

gccagaaatt gggagactga aggtgcaaag aagaaataat ttcttggaag ggctaacagc 18 0 

ttggttggac tgagatgggc tgaaaagcac agagggaagg gaagtaagag cactcttcct 24 0 

cggagatagg tgcacaggag gccagaaggg ataaaaatgc aaaactattt tgagtggaag 3 00 

agaatgaaga tgaggtaaca acagtgaggc agtttttaaa cttctcagta catgatggta 360 

cgagtggact taaaagtcta ttcaaaccaa aacccataaa ggctgtgaga tcttattcag 420 

tggatggaca tgaaaacagg ggtgccttta aaaataaaaa aatatttgca ggagttagtg 480 

tgagtaatta aatgcggtac tgagtgatca cctgagtctc tattttaaca ttcatgctga 540 

tttggacacc agcaaatgta aaatcttatt gttatcatat tgcggctaat gtggtccata 600 

acccctacta atctaccaac tagtctttga gcataactgt gcattgggca ctaagtcatt 660 

gtgaacaatt tattttaatg ataaggcaaa attgcgagaa tctccaatct ccgtaggcga 720 

gttttatgta tttatgaatt ctcatggaac gaaagctttt cattacagtc agcccacaat 780 

gccataaatc gattaatata tttacctctc atatatgaga cagatatata tataatcaga 840 

tatatatatg aaagatagtt tccataagaa ataaaaaaac agagaaagca ctcatgtgtt 900 

tcttttataa actactattt tatacaaata tttgtttaag gtttgtagaa gtatagttgc 960 

caccatgaga taccatttta acagagaaat atcacacaca cacacacaca gccctcatgg 1020 
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tcattattgc cattagttac aacaggcata 
tatctcgcca taatacagac acagctttaa 
5 aacaatggca gtccaatcat ctccttcagt 

ttatcttcac taaagttacc cctctggggc 
cttgccatag gaccttggga aaaataacct 

10 

agtagtcaaa atgcctactt acgaaggtgt 
ctctagtata tagtaggtgc tcagtaaaca 
15 agcagaaaaa gcatcaacat cagagccatg 

aactgtgagg tctttactaa ggcaaattct 
aatgaacaga atggtttctg aggtttggac 

20 

atcagtccgc aaactcttta gtgggctata 
tagagtggct tgttgtacaa agcagatccc 
25 caggggcaaa atcaagtgag aaacaaccaa 

gccagacaga taatactttt caactgattt 
ggagaaaaga ggggtcataa gggaaaggct 

30 

gcctggtacc ctggatgcct aatttgagag 
ctgggcatgg tggctcacag ctgttatcct 
35 gcttgagtcc aggagttcaa gaccagcctg 

aaatataaaa attagctgag tatggtggtg 
tgaggcagaa ggatcacctg agcccaggag 

40 

actgcactgc agcctgggcg acagagcgag 
aaaatattta ctaattatgt aacttggcac 
45 cctcttctat aaaacacagg tgatgattca 

gatgccatgt gtcaaattta cttccagccc 
ctgcctgttt tacagcacac ctgcagctac 

50 

ttcaggctgt ggtctaatgg ctgggatgtg 
atagacagtc ttgatttcca agacatctta 
55 aagatgtgtc cccaaaagag ggattttcag 

aagtggttaa catgaacaag tgaatgctaa 
ttaaaatgtc tgagctggaa agggccgtat 

60 

atcatgaaaa ctgataaata gcatttacta 
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gcccaagtag 


aattggctta 


caaaaaagtt 


1080 


tcaagcagtt 


gatgaataaa 


atgaaattaa 


1140 


tcaaacaatt 


gatgaccttt 


gcattttaag 


1200 


ggagtgtagt 


gactcccagc 


tcattccata 


1260 


ctctcaacct 


ccatttcttc 


atctgcagaa 


1320 


tgagtgagtt 


ggcaatacca 


tatacaatat 


1380 


actataatta 


tctggactca 


cctatggtac 


1440 


gtgagtgcct 


tctagcttgg 


ctctgttgct 


1500 


atacccagac 


tgcagcttcc 


tcatttgtga 


1560 


tagctctaga 


aatcctacct 


agtgaccaga 


1620 


ttaaaatatc 


tgtttttgtt 


agaggtcttg 


1680 


caggaccatg 


tggcatctgg 


ctatctaggg 


1740 


agtttcttct 


ctctttcctc 


tagcccatgc 


1800 


atagtcattc 


acatagtgca 


tatattggaa 


1860 


gaattcctat 


ttctcacagg 


agacaaagca 


1920 


atccaagtct 


tcattttaaa 


tctttacagg 


1980 


agcactttgg 


gaggtggagg 


ggggcagatt 


2040 


ggcaacatgg 


caagacccca 


tctctaaaaa 


2100 


gttgcccgtg 


gtcccagcta 


ctccagtggc 


2160 


gtagaggctg 


cagtgagctg 


tgatcatgac 


2220 


gccctgtcat 


gaaaacaaac 


aaacaaacaa 


2280 


aagtaaccta 


gcttctccgt 


gcccttgttt 


2340 


ccccttgcct 


gttctataag 


gctcagatga 


2400 


taaagtaggt 


ataaatgtga 


cagtagtcag 


2460 


tcagctctga 


ttgacttcag 


taagatatac 


2520 


gcaccggctt 


cagtaagata 


tgtgacctgg 


2580 


catgacttac 


aagcaataaa 


gacaagagca 


2640 


a uuccaggaa 


agacccagug 


gay cccgcat 




ggaaacccaa 


gatgaagatt 


gaacagaatc 


2760 


taattgcctt 


ctcacttcag 


aaatgaataa 


2820 


cactctccct 


ttgtcctagg 


cacaattaca 


2880 
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tatgttaact tatttatgtc ttacatctca 
gataaaatcg aaactaatgc tcagcaaagt 

5 

tatttattcc aacatgggtt ctttgcatac 
gtaactaaat ccaaaaatta ttttagatcc 
10 atggaacttc cattctgtgg ggaagagact 

ttcctacagt gatcaatgcc gtaaagcaat 
aggagtacat gttatagatt gagggttcaa 

15 

ctacacctga acaaaaagac actagccatg 
gcagcccacg agataagaat tattattatt 
20 gaatttaaag aacttgccca acatctcaga 

gcaatctgac tcaggaggcc acagtcttat 
ttctttttct ttgagaccga gtctcactct 

25 

ctcagctcac tgcaacctct gcctcctggg 
agtagctggg attacaggtg cacaccaaca 
30 gatggggttt tgccacgtta gccagcccgg 

catcttggcc tcccaaagtg cttggattac 
gccttcttga tgtgacttgc acaaggtggc 

35 

gacccctcct gcttctacag atagtcctta 
cataatgcac aaaggcagat aagcaaaagg 
40 tcaatattaa agcaaaagaa aagcatccag 

atatgcaagg tgtcctctac aaagataaag 
ggcagggact cctgggccca catgctcttc 

45 

actcagcagt aggccttgcc tcagatccaa 
aatgacactc atggaggaaa tgctgagaga 
50 tgccaaaaat cttgttcttg ctctcctcat 

gagagcaaaa tgacaacacc cagaaattca 
aaaggcccta ttgctatgca atctggtcca 

55 

gtgggcccca cgcaaagctt cttcatgagg 
ccttccatcc catgtcgtag ggattctctg 
60 atacagaatt caatccaatt tgaagaattg 
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gagaggggta 


ctgttctaac 


tttacagaag 


2940 


acaaagaaca 


agaatagcaa 


caaaaataac 


3000 


atttatttct 


tcaataatat 


ttattaagaa 


3060 


tgaacaagag 


agaacaaaat 


ctctactttg 


3120 


gacaataagc 


aattaaataa 


ataaggtaat 


3180 


taagatagga 


ttttgtaaaa 


gacagcaaat 


3240 


ggtaggctcc 


tctaggagct 


gacatttgag 


3300 


cacagaccat 


gagcccagtt 


aagtgttata 


3360 


tcaattttac 


agttgaacct 


gaggcccaga 


3420 


acaaatggag 


gaatcactat 


tgaaacctag 


3480 


atactgcatt 


agaaagcctt 


agagagcctt 


3540 


gttacccagg 


ctggaatgca 


gtggcatgat 


3600 


ttcaagcaat 


tctcctgcct 


tagcctcccg 


3660 


tgcctggcta 


atttttgtat 


ttgtagtaga 


3720 


tctcaaactc 


ctgacctcag 


gtgatctgcc 


3780 


aggcatgagc 


caccgtgccc 


gacctagaga 


3840 


agagttagag 


acagagagag 


gcctggaatc 


3900 


ccatactctg 


caatgttgcc 


tctggcccat 


3960 


acaaggacaa 


gtccattgaa 


aatacatttt 


4020 


gaataagaaa 


caaagaggac 


atgcagtcat 


4080 


aatgccccaa 


acccagttgt 


caagatcact 


4140 


ctaaacaacc 


cctccatctc 


ctttctcaga 


4200 


ggtcactcgg 


aagaggccat 


gtctaccctc 


4260 


agcattcaga 


tgcatgacac 


aaggtaagac 


4320 


tttgttattt 


gttttatttt 


taggagtttt 


4380 


gtaaatggga 


ctttcccggc 


agagccaatg 


4440 


aaaccactct 


tcaggaggat 


gtcttcactg 


4500 


gaatctaaga 


ctttgggggt 


aagtcagttg 


4560 


gctgacagaa 


gctgatgcgg 


tataggccac 


4620 


ggatccaacc 


tgatgtcttc 


tttatgtcta 


4680 
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acacagtggg ccaaatcagg ggtgcatcag 
atcttatgtc accttttggt tttggggctt 
5 ttgccagaat cccaggcata cctgctccct 

ttcgaacaag aaaaagacaa aattcttggc 
cagctccaag tcaggagcca gagcttccaa 

10 

tcagctcatc cactgctgcc tctgttctct 
ttccacattg ccctgggggg tcttctgatg 
15 gtgactgtgt ggtaccctct ctggggaggc 

tgggaaatgg tgcagacaaa aatgttaaaa 
gtgttgaggg aaatatatga gtaggaaata 

20 

ttgaccaaat tgagtataaa tcccatggtt 
cagagaaagg ggccctgcat ggatctcaca 
25 tcagttctgg tggggagggt ggccctagac 

agcaagaggc agcaggaaca tccacttcca 
gttactatgg tacctggtga aacctgtccc 

30 

cagagcagca ctcttttggc tgttttacgt 
ggtgtggcac ataggacagt gatgtttatt 
35 taaaagataa gtccaggatt aaaattttcc 

ttcctccatt ctacagcctg aatgtgagaa 
cctgtaacag ccaatgtttt catggagtgc 

40 

gcatcagttc actcttcaga gggcatcaga 
cagttccaca agttaaaaac aaagaaaaac 
45 agtattaatt gatgaggaaa tttgagttct 

cagatcccca gatgattgag taaaaggcat 
aggaatatga cagctgaaaa atgaatacaa 

50 

agttgctgaa agtgtaggct tctttctgac 
ctctcctcta tctcctgtct tgcccacccc 
55 agtatattat ttccggatca ctcttggcag 

caagtaacca tatgtccttc tttcccacat 
aaactgagac ccttaaaaag ccaaggtatc 

60 

ctccacaatg ttattgtgat tttatttatg 
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agaagttatc 


acttagatca 


cctctgggtg 


4740 


gtatatgcag 


gggtccccca 


tcccagtcca 


4800 


ggaaatgccc 


catgtggttg 


aggaaacaga 


4860 


acctccactg 


cttcctttag 


gcattcctca 


4920 


ccttgtcttt 


gcctgctagc 


agtgatgatt 


4980 


ccccaggctg 


tccagattat 


gaatgggctc 


5040 


atcccagcag 


ggatctatgc 


acccatctgt 


5100 


attatggtga 


gtaaaagaat 


agcagccatt 


5160 


ggctccacag 


ggatatgcca 


gattatttct 


5220 


ttattgggtt 


aaagtaatta 


agaagacagg 


5280 


gagagtcagt 


ggtcctgttt 


catgtgaatt 


5340 


gggactgtcc 


aaagcaagaa 


ctctccaaag 


5400 


atttagacta 


gatagcaaga 


tgttttggaa 


5460 


tctacccctt 


cttgcttaca 


attctgtttg 


5520 


atcacaagtc 


agtctcattt 


tgcttatcga 


5580 


acatgttttc 


caaatctgta 


accctgtctg 


5640 


tccccgtgat 


acttttcata 


gttgccacta 


5700 


aaacaagtag 


agatgggtta 


gaacaagagt 


5760 


aaggaagatg 


agcaatagtc 


atcatcactt 


5820 


ctgtgccatt 


caggtcaagt 


atttccttct 


5880 


gtcatttatg 


tcactgtgaa 


ccccaaaggg 


5940 


tagaaataaa 


acttttaaat 


ttatggtatg 


6000 


gtctctttgg 


tcttactata 


ttcctagtca 


6060 


gaatttagtg 


tcactgagcc 


tgaataaagg 


6120 


ctgataaaaa 


tgggtggatg 


gttgtgtgaa 


6180 


cagttatcaa 


tgttaaaaag 


tgatctccct 


6240 


ctctccatct 


cccccacctc 


tcttttttac 


6300 


M> *w ^ CA. VH CL 


U. CA. CL W V^* C*. 




63 60 
\j ~t \j 


gtcagagaag 


tacctatttt 


tttcggttaa 


6420 


acagcctctc 


agccctaaaa 


agcaaagacc 


6480 


aaaaacttag 


aagcgaggtc 


tatctgaagt 


6540 
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atgttcatgg gaacagaact aaaagcagat ccatgaaaac catacctaca gtcttaagaa 6600 

cgttaaatgc tgtgtgaaaa taatagacct ttctgaaagc cctatcattt ctcccagatc 6660 

accatttagg aaaattatct gatcaatgtc atgattgatt caaattctag ctaagccatt 6720 

ttttggccgt aacattgaac aagtcagttt acctctatgt tcctgagttt tcacctagaa 6780 

aggaagggta acagtccttg ctaccatgtg acgtccaatg gagatgaaag gcagtagagt 684 0 

gtgtgatggt gcttcacagg ctataaagta ctacactgtg gtcttgccca taaaacccct 6900 

agggactcat ctagatccag aggaaactgg cctgcagagc tgctgatgct gtatggatga 6960 

aaagagttta gcagcaagtt cgctcccaaa aaattcctcc cccaacactg ttactaaact 7020 

gtgtcacttg cataatcaat gagggaatgg gtggattgan nnnnnnnnnn nnnnnnnnnn 7 08 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnttatattt tcctgtttca 714 0 

accttttatc atccctgcaa gggtagaaca ttcacactga tattttctta cctatgctac 7200 

ccaaagacat cagccctaat tgtattttgg aagatagctg actggggctg attgcaacct 7260 

atgtcagcag gaatagatgt tgttactgtt gttgcttctg ctttttttat ttccatttat 7320 

ttgatagtac agatctagag ggttctatct gaaccttccc aacctatact tcataatacc 7380 

atcccactaa agtgtgatac aagaaacttc ttcactctct tccctctacc tatttatgaa 7440 

ggcagataat aaactggata atatttatct tcacttattc aacaaacatt tattgagtgc 7500 

ctactaggat ggtggcagtg gcagtgaagg aaatgcaagg atacaagata tagaatcaag 7560 

ggttactctt agaatttttg ctttataaaa cagatggatg gtgaatgaga tagggaagac 762 0 

tgagaaaaga acaggataga gacatgattt tattttatag tgacaaagag gctaaaaaca 7680 

actgagagaa cttcagtata tttagttgta gttgctttgt gagtcagggc agttgcattt 774 0 

ggaattccct cccagattat gttttccaaa gggaaatcaa acccaattaa taaatctgtg 7800 

tctccatttc aggtcaaagg aaaaatgata atgaattcat tgagcctctt tgctgccatt 7860 

tctggaatga ttctttcaat catggacata cttaatatta aaatttccca ttttttaaaa 7920 

atggagagtc tgaattttat tagagctcac acaccatata ttaacatata caactgtgaa 7980 

ccagctaatc cctctgagaa aaactcccca tctacccaat actgttacag catacaatct 8040 

ctgttcttgg taagtgttct tggtaagtgt gagattggat ttctctccag ggaggaagga 8100 

tgacttgttt attatgagca tgaactctgg atccagacca cctgagtttg ctagttactg 8160 

tctgtgtggc tttgggaaag aattttaacc acactgtgcc tcaatttctt caactgtaaa 8220 

atggggatgg taacactatt tatctaatag ggttgttctg aggaataaat aagttaatat 82 80 

atacaaagca tttagaacag tgtttggtac acagaagtgc tatataagca ttggctttaa 8340 
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cattaattat tctcaacatc attaatggca 
agcatagaag aaattcctaa gatgttatga 
5 gagtttgaaa gagataaaaa aaaatgcatc 

atgccacacc aactaaggct gaatacaatt 
atacagaatt gattcttcaa ctgattattc 

10 

aaccatctgt tgtgtgccaa aagttgtgtt 
tgtactagca gttctcacag ctattcatta 
15 caattatagt caacaataac ttactgaaca 

tttcaccttc attcttctgt tgtttttcag 
ttcttccagg aacttgtaat agctggcatc 

20 

agacccaaat ctgtaagtag tagcccctct 
ccacaaagga tcatttatgg agaatgtaat 
25 ttaaaaaaaa aaattggtct tggcatattt 

tttgaaataa gtctgtagtt agagcctaca 
tttttttgag acagagtctc actttgttgc 

30 

cccagtgcaa cctctgcctc cgggttcagg 
tgggattaca ggcacccgcc atcatgcctg 
35 gttcaccatg ttggccaggc cagtcttgaa 

gcctccgaaa gtgctgggat tgcaggcgtg 
ttattaacct tccctgctgc ttgacttcag 

40 

cagtgtgtaa aagcaagatg gttgaaggtt 
tttttcagta ttcccctctt ccctcctttt 
45 aaagggctcc acaaatggga aataaaaaca 

cagaaccaga ctgttagatt tagaatttag 
taaagaaaaa tagtatcaaa gagagaaaat 

50 

aaatcagaac gaaaatctaa actcctctat 
ggtcattcca ggcactgtgg tcaatgtctg 
55 tttaataaat gacttgataa ggatataagc 

atataataaa tgacataagt agcataaaaa 
attgttgaca aaggtgtcag tggtaaaagt 

60 

gaacatagtt ctcctgtcag cagaagaaaa 
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ttaatattta 


ccattatatt 


aactaagatg 


8400 


atatcttttg 


cccaaggact 


tttaaatgta 


8460 


tcccaaggat 


gaagaggtct 


cctaggcatg 


8520 


taatttgaga 


ctcattactt 


attctttcct 


8580 


ataccttact 


ttctatcagc 


aatacacatt 


8640 


aagagttagg 


gttataaaga 


tgctgtctcc 


8700 


cttgtctaaa 


gaattgatct 


ettaategtt 


8760 


ccaacaatgt 


tctttgtgcc 


attattacat 


8820 


ggcattttgt 


cagtgatget 


gatctttgee 


8880 


gttgagaatg 


aatggaaaag 


aacgtgctcc 


8940 


ggccaaaacc 


tccctctaga 


aaatccacat 


9000 


ctgatgagtt 


gttgaaacta 


ggagcttgat 


9060 


ctagaaagca 


aataatatga 


tgtcttgaaa 


9120 


gttctatatt 


ctgtgcgtct 


tttttttttt 


9180 


ccaggctaga 


gtgcaatggc 


gcgatcttgg 


9240 


cgattctcct 


gcctcaccct 


cccgagtagc 


9300 


gccaattttt 


gtatttttgt 


agagaegggg 


9360 


ctcccgacct 


caggtgatcc 


tcccacctca 


9420 


agccaccatg 


ccccacctct 


gtgeatctte 


9480 


gcttccatcc 


tgatttcatg 


attccatata 


9540 


gaaggctaag 


tcactagttc 


tttggtttca 


9600 


cacctaattc 


ctcttccaaa 


acaacaggtt 


9660 


gaccaataat 


agatatatta 


etaactgett 


9720 


aaatcatcgc 


taattacttc 


attttgeatt 


9780 


tattctccca 


gagttatatt 


cactaaaggc 


9840 


tactgcaatg 


ttcttttccc 


aataccacgt 


9900 


ctgcccttga 


agatttattc 


agacttgagt 


9960 




Pi Pi Pi 1~ f~ t~ Y~ cin c 
acta, i— i— t_ i_ y y 


cl l, ex ct d y y \^ 


X U VJ u 


ccaggaggta 


tttgataaat 


gtttgtggag 


10080 


aaagaatggt 


ttgtttaatt 


ttctgtttta 


10140 


aaaagaacag 


actattgaaa 


taaaagaaga 


10200 
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agtggttggg 
tccaatccaa 
tcaggaatcc 
tttccttttt 
gaggtactct 
tccttctctc 
cttcttcttt 
gatttcctac 
ttctccatca 
cattcctaaa 
tcattgtttt 
tttatctttc 
gaagctctaa 
acatgtctct 
tacttttagc 
gcccaaattt 
atttcatcaa 
tacaagtaat 
cataactagt 
atgtttgaaa 
cagcttttat 
agccagattc 
aactacttgc 
tatagacact 
caacagctgc 
cctagggtga 
gctgccagat 
atttttagcg 
tttctgtcta 
ttttctgcaa 



ctaactgaaa 
gaagaggaag 
tcaccaatag 
taaacattag 
gcacatacgc 
ttacattgaa 
tgagcaactt 
taacctgttc 
acaaccaggg 
ctatcttttt 
aaggatgata 
tacagggctg 
atagccaaca 
gccttaactt 
cattatgcga 
caaataataa 
ataacttcat 
tcaaagaggc 
aattatgaga 
taaccaaatt 
caagaattta 
agattttacc 
atcataaata 
ctcagaatat 
cctacttttc 
agtcactata 
tgtggcaggt 
accactagtg 
tgtaaagttc 
cagatgattc 



catcttccca 
aagaagaaac 
aaaatgacag 
tgttcatagc 
accacatctc 
tgtagagaat 
tcttacactg 
cttggatagg 
agactgcacc 
tttattccac 
aaaaaaaata 
acattgtggc 
cccatctgtt 
ttcctaactc 
gaaaagaaaa 
ttaacattta 
ttgatgttcc 
caaataactt 
gccttgtttc 
gtctctccaa 
ctctctgacc 
catggggata 
tataataact 
catacttgga 
aattcagata 
atctgtagtc 
aaagagacaa 
gaagacattc 
tcaaaatttg 
caacatgggt 



accaaagaat 
agagacgaac 
ctctccttaa 
ttccaagaga 
tatctggcct 
gtagccattg 
aagaaaggca 
ctttttagta 
tgatggaaaa 
atctacgttt 
acaactaggg 
acattcttag 
ttttgtaaaa 
ccactctagg 
aaatgaccat 
gttatattta 
atgatcaaga 
gtaaacaaga 
ataaccaggt 
tgtctgcata 
agcaataaac 
aaaagactca 
ggtagtttat 
aacaatgtaa 
tactagtacc 
tattatttgg 
tgtaatttgc 
cccaaaatta 
ttctaaatta 
gtttgtctat 



gaagaagaca 
tttccagaac 
gtgatttctt 
catgctgact 
ttgcatggag 
tagcagcttg 
gaatgagtgc 
tagtattttt 
gatatatgac 
ttggtggagt 
acaatacaga 
agttaccaca 
acagcatagc 
ctattgtttg 
agaaaatgcc 
taatttccag 
aagaatccct 
aaaggtaact 
cttcttactc 
aactgtgaga 
aagcactgag 
gactttcacc 
atgaagcaga 
ttaaaatgcc 
ttacctagaa 
gcatttgcta 
actccctatg 
gaaaaaaagg 
ataaaactat 
tcttctttac 



ttgaaattat 
ctccccaaga 
ctgttttctg 
ttcatttctt 
tgaccatagc 
tgttgtcacg 
ttcagaatgt 
ttttgtcatt 
tgcttcatga 
cccttttgca 
acccattcca 
ccccatgagg 
ttatacatgg 
catgtctacc 
accatgaggt 
atgacaaagt 
atctctattt 
tgtcaacagt 
aaatcctgtg 
gccaagtcaa 
agacacagag 
acatttggaa 
cactaagtgc 
gaatctgagt 
ataatgttaa 
catgatgagt 
atatttctac 
agatagaaga 
ctttgtgttc 
tcttgaaaca 



10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 



-94- 



WO 02/062946 PCT/US01/48437 

ttagaccatg ggaggctctt acagccttga gttgatattt atacaaccca aatctaggtt 12060 

tgaacggtga ggtgtcaggt catcaaatat tcatgtctat atagtcttac acaggttctc 12120 

aaaaaaaatg ttcatgggat aggtcattga taatggattc cttattctga gaactccaga 12180 

cgactgaaat atatgagaga aggaaaagga catagtagga gcaggcctga gaaaaaaatg 1224 0 

aaagtcagaa atctttaaaa aaatacaaga tcttatttct atcttatttt ttctcctctt 12300 

ctgaaatata tatgaggatt cctctccaaa cccatggttt ctctaagaat tttgagtcat 12360 

ttgtatgacc tcaaataatt agttttagct gacctcacat aactccttat aataggagac 12420 

atctttaatg tctgctatta aagaaggatg aaaattccta tgaccttctc cccgattatc 12480 

cctttggcta tatagagtca aataataaca ttgaccaata gtaaacatgc tttgccaaga 12540 

agtagaagat atattctcta gccttagttt ttcctcccaa tttgcatttt tgtaaaaata 12600 

atgttgtatc cacaaaggaa ataaacttta aaaacccaag tgcactcatt gactaggagg 12660 

ctgacgtaac ttattttcat tttcttctgg ttttcttata atctttgttt aaaataaggt 12720 

gataaaccat cccagtttgt ctgagtctga gggattactt gtttggaaaa ttttaatcct 12780 

ggacttatct tttatagtgg caactatgaa aagtatcacg gggaaataaa catcactgtc 12 840 

ctatgtgcca cacccagaca gggttacaga tttggaaaac atgtacgtaa aacagccaaa 12 900 

gagtgctgct ctgtcgataa gcaaatgaga ctgacttgtg atgggacagt tc 12 952 

<210> 74 

<211> 10100 

<212> DNA 

<213> homo sapiens 



<220> 

<221> genomic_DNA 

<222> (1) . . (10100) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding__region 

<222> (1918) . . (1973) 



-95- 



WO 02/062946 PCT/US01/48437 

<220> 

<221> coding_region 
<222> (2844) . . (2973) 



<220> 

<221> coding_region 
<222> (3488) . . (3622) 



<220> 

<221> coding_region 
<222> (5944) . . (6000) 



<220> 

<221> coding_region 
<222> (6552) . . (6710) 



<220> 

<221> coding_region 
<222> (7116) . . (7214) 



<220> 

<221> coding_region 
<222> (8710) . . (8805) 



<400> 74 

ggaaaccccc cccaaaaaaa aaaaaaaaaa aaaaaagaaa agaaaagaaa aagaaagata 60 

aaatgaaaat tcgtggctgg gcgcagtggc tcactcctgt aatctcagca ctttgggagg 12 0 

ccgaggtgag tgagtcacct gaggacagga gttcaagacc agcctggcca acatggtgaa 180 

accccttctc tactaaaaat tcaaaaaaat tagcccagtg ttgctgcaca ctcctgtagt 240 

cccagctact cgggaggctg aggcaggaga attgcttgaa tctgggaggc ggaggttgca 3 00 

gtgagccatt gccctccagc ctgggtgaca gagtgagact ctgtctcaaa aaaaaaaaga 360 
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aaaaaaaaga 


aaatagaaaa 


ttatcgaggc 


tatcatttta 


attttaagtc 


tgctctcaaa 


420 


tgttaccttg 


tctgggaatt 


ccttccctga 


actctacata 


tcccctatca 


ggcagaattg 


480 


ttttaccctt 


ctatgtgttc 


caacatcctc 


tggcctccaa 


catattgggt 


aataatatat 


540 


catttatctt 


tgttctattc 


tgcttgattg 


tggactagaa 


ggacctggga 


ttttggcatt 


600 


catctgtgca 


tgcacagtga 


taagggcagt 


gcctggcatg 


gaacactaag 


ttctttttgg 


660 


atgaacgtat 


aagtgagaaa 


gttctattcc 


ttaattccaa 


tagcaccttg 


tttaggtcca 


720 


ccaattcctg 


aagacaccat 


gggaagatga 


cacctgagtt 


aataaactta 


gatacaaaag 


780 


ctataccaat 


atgtctttta 


gtatactctg 


tttatgcaaa 


tatcttagtt 


tcctgcaaaa 


840 


agattttctt 


tttcaccccc 


cacccccacc 


aaaaaaacca 


cacatgatta 


gaggtgtgat 


900 


ttttagtatc 


ttaaatatta 


ttgaaagcca 


ttcaaggtta 


tgtttataat 


taaactgaaa 


960 


taaaagagat 


taaaaccttc 


tctgaatttt 


atctgtactg 


attttgctat 


ggtctagcta 


1020 


gttatggtat 


ttcaatgatc 


acatggttta 


gccctaattc 


taatttaggc 


atactactca 


1080 


atccaatgct 


gtttagattt 


gctcatggga 


tttgttataa 


agttcatgtc 


tctcagtgtt 


1140 


ctcactatcc 


cattcttgcc 


actgtaaaga 


tctaagcacc 


atcagtctaa 


taatttagca 


1200 


gacaatggag 


tcttcttagc 


ttttacacac 


tatcaggtat 


ggttcacagc 


agttagaatg 


1260 


cagcctgttg 


tatttccaag 


accatctgct 


gcagcgaatc 


atgtaaacag 


tttccaaaag 


1320 


acggaaagag 


agagaaagat 


gatagatatt 


aaagtcagag 


atggatctgg 


agatttagag 


1380 


actaacacat 


actcactcac 


atacaacagc 


acttcagttt 


tgggtctatg 


tgtgaaacca 


1440 


agcatgcctt 


aaacaaggaa 


gacaagcttt 


tcaaaggtgc 


aattggataa 


cttctgccat 


1500 


agaaatggct 


gaattgggac 


acaagtgggg 


acaattccag 


aagaagggca 


catctctttc 


1560 


ttttctgcag 


ttctttctca 


ccttctcaac 


tcctactaaa 


atgtctcatt 


ttcaggttct 


1620 


gtaaatcctg 


ctagtctcag 


gcaaaattat 


gctccaggag 


tctcaaattt 


tcttatttca 


1680 


tattagtctt 


tatttagtag 


acttcttaat 


ttttctattc 


atcacaagta 


aaagcctgtt 


1740 


gatcttaatc 


agccaagaaa 


cttatctgtc 


tggcaaatga 


cttatgtata 


aagagaatca 


1800 


tcaatgtcat 


gaggtaaccc 


atttcaactg 


cctattcaga 


gcatgcagta 


agaggaaatc 


1860 


caccaagtct 


caatataata 


atattcttta 


ttcctggaca 


gctcggttaa 


tgaaaaaatg 


1920 


gacacagaaa 


gtaataggag 


agcaaatctt 


gctctcccac 


aggagccttc 


caggtaggta 


1980 




tttttttcta 

L~ I*. ^ W- G*. 


w w >— • v» d ^5 V* c*. 


cttaataaca 


oaaaaaatca 


taatcacaot 


2 04 0 


gcttaggaga 


tgaaacttta 


ttgatttagg 


catggatcca 


tctagtttaa 


ttaatatatt 


2100 


gggtatgagg 


aagctacttg 


ctgtactttc 


catgtggttc 


tctctccctg 


gagaggaaca 


2160 


tttttactca 


gcttgcaaac 


tggaaataga 


ttttctcaca 


ttagaagctc 


attttctggg 


2220 
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tatgagacag gagagttcat actgtgtatg 

ctgagggaca catatccttc acacatgctt 

5 

ttgattggtc tttataatcc ataagctgtg 

agagtcccat ctagctattc aaagccattc 

10 ggtttgttac tgacaaaata tattcagata 

ataggtaaat gtgtttattt tcagttttac 

gaaagtaaca ctggattaag aataggaaaa 

15 

atctaatcac ttggatagtg ttcaggtgtt 

tagtttcttg gtgattacaa tgaagatttg 

20 aaactcccct ttctgtctgt cgagaatgtt 

tgaattgctt ttaaatttca cagtgtgcct 

gaagtatctt caggcagact attgaagtcg 

25 

acagttttga aaaaagagca ggagttcctg 

a 9 a 9t aa 999 ttgggtctag aaaagaatat 

30 ttcatgagag gtgttaggtc ctttaaaaaa 

gtccttttgg gaagagagaa gcatttcctc 

ggccaacagg agtgaaggat tcataatctt 

35 

tgatgttcaa catgactacg tacataaagg 

attaggtagt gcttgtatca attcttatag 

40 tggatcattt ctcaggacag tctaggacac 

attaaaaaat gatacaatct cgggaaaatt 

tctataggta acacaaattc tgactgctat 

45 

ctctgtactt gatatttcac acattgaggg 

tccattctgg ggagccatat ttgtgagtat 

50 aacataggtt tttctctttc tcagatctaa 

atttataatt cttaattata aattatatgt 

caacaacaaa agattctttt tacaattaac 

55 

tcccatgaga aacctgaatc taatacaagt 

atagtaattg aacctaaact ttcagaatcc 

60 atcaataaag ttatatgata aatacataca 
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tagatctctg 


gcttctgggt 


ctgacatgtg 


2280 


ttataaatac 


ttgataaagt 


aacctgcttc 


2340 


ggatgcttct 


ctgaagatga 


aaatagtaat 


2400 


cttcattgta 


ttctgtgcac 


atgaagttgg 


2460 


catttctatg 


ttaaaaggat 


tgtgagatgc 


2520 


ttgtcaacat 


agatgaatga 


gaaagaactt 


2580 


tttggcatgg 


attttgctcc 


attttgtccc 


2640 


cttggtcagt 


tacttggatg 


ctctgagctt 


2700 


aattacagga 


tggctttgaa 


aaaataaaca 


2760 


gcacagggag 


ttacagaatg 


ttctcatgac 


2820 


gcatttgaag 


tcttggaaat 


atctccccag 


2880 


gcctcatccc 


caccactgca 


tacatggctg 


2940 


ggggtgagtg 


agcctcctcc 


aactttgact 


3000 


tgagttgcat 


caactgtttt 


cccacttgga 


3060 


catggtagat 


aaagagttga 


cactaactgg 


3120 


ataaagactt 


taaattgcta 


ggacgagaat 


3180 


tatctttact 


tagatgtaaa 


gaacaattac 


3240 


cgcatggaga 


aaagtattgg 


ccttccatgc 


3300 


tggctagggt 


atcctggaaa 


atcttacgtg 


3360 


taacgcagtt 


tctcatgttt 


ggcttctatt 


3420 


tttttgattt 


tcatgaaatt 


catgtgtttt 


3480 


gatatgcctt 


tgttttggaa 


cagttgtctg 


3540 


agacattttt 


tcatcattta 


aagcaggtta 


3600 


atatctataa 


ttgtttctga 


aataacactg 


3660 


ccagttgttt 


attcccagta 


ttaacatgat 


3720 


gagcatatat 


aacatagata 


tgctcattaa 


3780 


ggtgggttaa 


acatttagcc 


cacagtttta 


3840 


taaatgactt 


gcctaagggc 


cacttgacta 


3900 


aactccagga 


acatacttct 


agcactattc 


3960 


actttatctg 


tcaactaaaa 


ataacaacaa 


4020 
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aggctgggca 


tggtggctca 


cacccgtaat 


cccagcactt 


tgggaggctg 


aggcaggtgg 


4080 


atcacctgag 


gtcaggagtt 


tgagaccagc 


ctgaccaaca 


tggtgaaacc 


tcatctctac 


4140 


taaatataaa 


aaattagctg 


agtgtgatag 


tgcatgcctg 


taatcccagc 


tacttaagag 


4200 


gctgaggcag 


gaggcttgtt 


tgaacctgga 


aggcagaggt 


tgcagtgagc 


tgagattgtg 


4260 


ccattgcact 


ccagcctggg 


caataagagc 


gaaactctgt 


ctcaaaataa 


taataataat 


4320 


aatagaaaat 


aaagttgtct 


tcatgaaaaa 


tgaggaaaga 


gattgctggg 


gtgagaaaca 


4380 


ttaagatcaa 


agggcatatg 


gtgaccttct 


atgccctaga 


aactctttta 


ggtattttct 


4440 


cctggtatct 


cttttactca 


tcgttctatc 


tggaaaaata 


ggtggatgag 


tgagataata 


4500 


aggtatataa 


ctttttaaag 


gtctaattga 


catataataa 


attgcaagta 


tttcagatgt 


4560 


acaatttgct 


aaccttgaca 


cacatagaca 


cacatgaaaa 


catcaccaca 


ttaatacaat 


4620 


gtatgtatcc 


atcatttcca 


aaagcttccc 


tgtgtatctt 


tgtaactctt 


tcttcctccc 


4680 


tccactcctt 


gtcctctcgt 


tcccaagaaa 


acattgatct 


gctttctgtg 


aatataaatt 


4740 


aacttacatt 


ttttagagct 


ttatataagt 


atgttctctt 


tactgtttgt 


cttccttcgc 


4800 


tgcacagtta 


ttttgagatt 


cttcaagttt 


tttctttata 


tcgatacttc 


attcacaaga 


4860 


atatatttta 


attctagact 


atgtcacatt 


gactttgtag 


tctgctaaat 


ccttagtgct 


4920 


cagatgactt 


gttcaggact 


ctccttgaac 


ctgtacctct 


gttataattg 


aaacttgtct 


4980 


ctactgtctt 


tttatttcaa 


acacagctta 


tgaggtgtct 


ctcaacccat 


caaactcaca 


5040 


atctgagtct 


ttaggagatt 


gctttgaatt 


tgtgctattg 


acttatattt 


atatcaaata 


5100 


tgtaaatgtt 


tggtaaaaat 


atcatcatgt 


acattttcat 


aattactcta 


tattcacatg 


5160 


atatatgtca 


gactctggaa 


atatgcatgc 


cacagacacg 


tgtttcttgc 


ctaaaggggc 


5220 


tgatggaaga 


cgcacataca 


aatagacgat 


tgcaatagaa 


tgagagtggt 


ggtctaatcg 


5280 


attcatgccc 


tgatgttgct 


ggacgttgct 


actccaagag 


taacccctgc 


attgtcaggg 


5340 


ttagcatctc 


ctggaagcct 


catgtaaatg 


aagaatttca 


tgctccatcc 


aggacctaat 


5400 


gaataagaat 


ctgcatttta 


gcaagaccct 


catatgattc 


atatacactt 


tttttttttt 


5460 


tttttagatg 


gagtctcact 


cttgtcgccc 


aggctggagt 


gcaatggcat 


gatcttggct 


5520 


cactgcaacc 


tctgcctccc 


gggttcaagt 


gattctcctg 


tctcagcctc 


cctagtagct 


5580 


gggactacag 


gtgcatgcca 


cagtggctgg 


ctaatttttg 


tatttttagt 


agagacaggg 


5640 
















gcttcccaaa 


gtgctgggat 


tacagacatg 


agccaccaca 


cccagcctta 


ttcgtataca 


5760 


catttaattc 


tgagaagcac 


tctatagaaa 


ataagaataa 


gaaaatattg 


ggctcacagg 


5820 


tgacattaat 


aagtaacttt 


atcgagtacc 


ccaaatgtta 


cctatgtttg 


gaagatgggg 


5880 
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ttaaaaagga 


cacattgaaa 


acaagaaact 


cattgtggct 


tttttttcct 


cctttttgaa 


5940 


cagttttcta 


tttctggaat 


gttgtcaatt 


atatctgaaa 


ggagaaatgc 


aacatatctg 


6000 


gtgagttgcc 


cgtttctgtc 


tttgtccatc 


cttgaaaaga 


taagaagaac 


agagttttaa 


6060 


gagtcttaag. 


ggaaacacat 


ctttgtctcc 


tatattactt 


gtgaatgtgg 


atatatgatt 


6120 


ttgtttcaat 


ctattttgtg 


tcctaaggct 


ttttgcaaca 


gaagttggat 


ataatcatta 


6180 


gaaacataaa 


ttgtaccatt 


taacatacaa 


tgaagtttat 


gtttaccttg 


acgtttcttc 


6240 


taaaaaaagt 


gtcctcacac 


cggcattgtc 


cttgtaggca 


tattcacatg 


atcaaataaa 


6300 


taattagttt 


tcaattaagg 


agaatatttg 


aggaaagacc 


gtacgtgttc 


atgtggttcc 


6360 


tgaaggcagt 


ccagtgagaa 


agtaatatat 


gctcattaaa 


caatgcggac 


attttcaggg 


6420 


tttccctttt 


taaccaaaat 


ttggaagcaa 


tgtggaattt 


actggatgca 


tccagccctg 


6480 


aaatgaagat 


aggtttattg 


aatgtgccag 


caagtgcagg 


cccaggtctg 


agtgttcttc 


6540 


attattatca 


ggtgagagga 


agcctgggag 


caaacactgc 


cagcagcata 


gctgggggaa 


6600 


cgggaattac 


catcctgatc 


atcaacctga 


agaagagctt 


ggcctatatc 


cacatccaca 


6660 


gttgccagaa 


attttttgag 


accaagtgct 


ttatggcttc 


cttttccact 


gtatgtattt 


6720 


ttttttgtgt 


gggaagacta 


agattctggg 


tcctaatgta 


agtaagaagc 


cctcttctcc 


6780 


tgttccatga 


acaccatcct 


tttctgtaac 


ttctattaca 


cagtatagtg 


gttctgtaag 


6840 


ttcacacagc 


ccagggagat 


gctggctgcc 


cactcccctc 


aacccaggca 


aattcctcgg 


6900 


ggttaaagtt 


atctactgca 


agtgacgatc 


tctgggtttt 


tctgtgcctg 


tgtttgtgtg 


6960 


tgtgtgtgtg 


tgtgtgtgtg 


tgtgtgtgta 


tgtgtcactt 


taaaaggact 


ggtcagatgg 


7020 


tagggagatg 


aaaacaggag 


atgctataag 


aaaataaact 


tttggggcga 


ataccatgtg 


7080 


actctttttg 


tttgtcattt 


gttgctgttc 


aataggaaat 


tgtagtgatg 


atgctgtttc 


7140 


tcaccattct 


gggacttggt 


agtgctgtgt 


cactcacaat 


ctgtggagct 


ggggaagaac 


7200 


tcaaaggaaa 


caaggtagat 


agaagcccga 


tataaaatct 


tgaatgacag 


gttaacgaat 


7260 


tggagcttta 


ttccttaaaa 


atatggcctg 


ggttttctga 


aacatttctt 


ccagaaaata 


7320 


gtttctccaa 


gttttattac 


tttggtttac 


aaatctcaca 


ttttaaatca 


cattttatac 


7380 


ccataagtag 


cacacatttc 


ataatatatc 


cctctgaatg 


agggttggga 


taataggatc 


7440 


tgtataatgt 


tagaaaatgc 


cttaaagtgt 


gtggagcatg 


agagatggat 


gtgcagaagg 


7500 


cttgtgagga 


aaccacccag 


gtatctggcc 


ttgttttctg 


ccccagaagt 


agccgcctat 


7560 


tcctgtttct 


gttttattcc 


tttgtttctt 


gacttttcct 


ttccaacttg 


ctctaaaacc 


7620 


tcagttttct 


ttcctttctg 


attcatgact 


accaaatgtt 


tccacttgcc 


tcacccgtcc 


7680 
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attacacctt tgataagaac caccagcacc 
gaagaaacat actctctcca tctgtccact 
5 taaaatcact ctcctccagg ctgggcacgg 

gaggctgagg agggcggatc acttgaagtc 
aaaaccaaat cttcaattaa aaccaaatct 

10 

aaacaaaaca acaacaacaa aaacagaaaa 
cctgaggttc cagatacttg ggaggctgaa 
15 aggttgcagt gagccgagat catgccactg 

cccagcacat tgggaggcca aagctgaaga 
ctggccaaca tggtgaaact ccgtctgtac 

20 

gcacacacct gtaatttcag ctacttagga 
gaggcggaag ttgcagtgag ccaagatcgt 
25 tgagattctg tctcaaaaaa aataaaagaa 

ataaataaga cagcagatat actaaggaat 
atttcttttg ggcccacttg gccagctagt 

30 

aatcagagtt taatgacaga gagcgtgaga 
ctcttgagcg agacttctag ggatgggaaa 
35 ccttatcagg ttccagagga tcgtgtttat 

agtgagttgg aagacccagg ggaaatgtct 
cagaacactc tgattcacag ccaaggatcc 

40 

ggaaaaattt ctattctctc cacagcctgc 
agaatatttt gtttctgctg cttctgtcca 
45 gatagactcc tatttttctt gttttatatt 

acatgtagta agcaagattt aactgtttga 
cggtggttga aacttgagtt aaacctttga 

50 

tttttttttt ttagcattct taatagttac 
acccacacag tcacagtcac acacacatat 
55 gcaaatattt taccaccagt caataataca 

tctgtataat tgcctgaatc accagcacat 
aagtaggaag tggggaattt tattaagtta 

60 

aacagggaaa ttataagtgc agagattaac 
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ttgtgctcat 


gtacttgccc 


atgtctgatg 


7740 


ttcctgaggc 


attcaagtct 


agccaccttt 


7800 


tggctcacgc 


ctgtaatctc 


agcactttgt 


7860 


aggagttcaa 


aaccagcctg 


gccaaatggc 


7920 


taaaccaaat 


ctctactaaa 


aaatacaaca 


7980 


ggaaacatta 


gcccagcgtg 


gtggcaggta 


8040 


gcaggagaat 


cgcttgagcc 


caagagatgg 


8100 


caccacagcc 


agggtgacag 


agccatactt 


8160 


ataatttgag 


gtgaggattt 


ggagaccagc 


8220 


taaaaatata 


aaacttagtg 


gggcatgggg 


8280 


ggctgaggca 


ggagaattgc 


ttgaacccgg 


8340 


ggccactgca 


ctccagcctg 


ggtgacatag 


8400 


atttaaaaaa 


tcactctctt 


ccaaagatag 


8460 


aacctcacca 


acttgtcatt 


gactgacatg 


8520 


ctggtttggt 


tttctggaaa 


tgaaagaaat 


8580 


cccagaaaga 


caaaagtaga 


tgaggtaagt 


8640 


tttgtggtga 


ttgatatgaa 


atgatttttc 


8700 


gaagaattaa 


acatatattc 


agctacttac 


8760 


cctcccattg 


atttataaga 


atcacgtgtc 


8820 


agaaggccaa 


ggtcttgtta 


aggggctact 


8880 


tggttttaca 


ttagatttat 


tcgcctgata 


8940 


ccttaatatt 


ctccttctat 


ttgtagatat 


9000 


atgaccacac 


acatctctgc 


tggaaagtca 


9060 


ttataactgt 


gcaaatacag 


aaaaaaccca 


9120 


ccgcttgaaa 


aaatttggtt 


tttagggttt 


9180 


agttgggcat 


gatttgtacc 


atccacccat 


9240 


gtattactta 


cactatatat 


aacttcctat 


9300 


tttttgccaa 


gacatgaaqt 


t ttataaaga 


9360 


tcactgacat 


gatattattt 


gcagattgac 


9420 


ctcgttgtct 


ggggaggtaa 


ataggttaaa 


9480 


atttcacaaa 


tgtttagtga 


aacatttgtg 


9540 
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aaaaaagaag actaaattaa gacctgagct gaaataaagt gagtggaaat ggaaataatg 9600 

gttatatcta aaacatgtag aaaaagagta actggtagat tttgttaaca aattaaagaa 9660 

taaagttaga caagcaactg gttgactaat acattaagcg tttgagtcta agatgaaagg 9720 

agaacactgg ttatgttgat agaatgataa aaagggtcgg gcgcggaggc tcacgcctgt 9780 

aatcccagcc ctttgggagg ccgaggtggg cagatcacga agtcagtagt ttgagaccag 9840 

cctggccaac atagtgaaac cccgtctcta ctaaaaatac aaaaaaaaaa ttagctgggt 9900 

gtggtggcag tcacctgtag tcccagctac ttgggaggct gaggcaggag aatcgcttca 9960 

acctgggagg cggaggttgc agtgagccga gatcgcacca gtgcactcca gccttggtga 10020 

caatgggaga ctccatctca aaaaaaaaaa aaaaaaaaaa aaaaataaaa atttacaaat 10080 

tttaaaatgg ggggaaaatt 10100 

<210> 75 

<211> 22884 

<212> DNA 

<213> homo sapiens 



<220> 

<221> genomic__DNA 

<222> (1) . . (22884) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding_region 

<222> (12238) . . (12393) 



<220> 

<221> coding_region 
<222> (13544) . . (13681) 



<220> 

<2 21> coding_region 
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<222> (15338) . . (15394) 
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<220> 

<221> coding_region 

<222> (18027) . . (18188) 

<220> 

<221> coding_region 

<222> (19594) . . (19695) 



<220> 

<221> coding__region 
<222> (20223) . . (20249) 

<400> 75 

tttttgttct gccatagttt actgagaatg atgtttccaa tttcatccat gtccctacaa 60 

aggatatgaa ctcatcattt tttatggctg catagtattc catggtgtat atgtgccaca 120 

ttttcttaat ccagtctatc attgttggac atttgggttg gttccaagtc tttgctattg 180 

tgaatagtgc cgcaataaac atacgtgtgc atgtgtcttt atagcagcat gatttatact 240 

catttgggta tatacccagt aatgggatgg ctgggtcaaa tggtatttct agttctagat 3 00 

ccctgaggaa tcgccacact gacttccaca atggttgaac tagtttacag tcccaccaac 360 

agtgtaaaag tgttcctatt tctccgcatc ctctccagca cctgttgttt cctgactttt 420 

taatgattgc cattctaact ggtgtgagat gatatctcat agtggttttg atttgcattt 480 

ctctgatggc cagtgatgat gagcatttct tcatgtgttt tttggctgca taaatgtctt 540 

cttttgagaa gtgtctgttc atgtccttcg cccacttttt gatggggttg tttgtttttt 600 

tcttgtaaat ttgtttgagt tcattgtaga ttctggatat tagccctttg tcagatgagt 660 

aggttgtgaa aattttctcc cattttgtag gttgcctgtt cactctgacg gtagtttctt 720 

ttgctgtgca gaagctcttt agtttaatga gatcccattt gtcaattttg gcttttgttg 780 

ccattgcttt tggtgtttta gtcatgaagt ccttgcccat gcctatgtcc tgaatggtaa 840 

tgcctaggtt ttcttctagg gtttttatgg ttttaggtct aacgtttaag tctttaatcc 900 

atcttgaatt aatttttgta taaggtgtaa ggaagggatc cagtttcagc tttctacata 960 

tggctagcca ctttccccag caccatttat taaataggga atcctttccc cattgcttgt 1020 
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ttttctcagg tttgtcaaag atcagatagt 
ctgttctgtt ccattgatct atatctctgt 

5 

ctgtagcctt gtagtatagt ttgaagtcag 
ggcttaggat tgacttggtg atgcaagctc 
10 ttttttccaa ttctgtgaag aaagtcattg 

aaattacctt gggcagtatg gccattttca 
gaatgttctt ccatttgttt gtatcctctt 

15 

ccttgaagag gtccttcacg tcccttgtaa 
aagcaattgt gaatgggagt tcactcatga 
20 ataagaatgc ttgtgatttt tgtacattga 

ttatcagctt aaggagattt tgggctgagt 
catctgcaaa cagggacaat ttgacttcct 

25 

tctcctgcct aattgccctg gccagaactt 
gagggcatcc ctgtcttgtg ccagttttca 
30 gtaagatatt ggctgtgggt atgtcataga 

caatacctaa tttattgaga gattttagca 
ttctgcatct gttgagataa tcatgtggtt 

35 

aaaaaaaacc ctgtcaaaaa gtgggcgaag 
atttatgcag ccaaaaaaca catgaaaaaa 
40 caaatcaaaa ccacaatgag ataccatctc 

tcaggaaaca acaggtgctg gagaggatgt 
tgggactgta aactagttca accattgtgg 

45 

gaactagaaa taccatttga cccagccatc 
aaatcatgct gctataaaga cacatgcata 
50 gcaaagactt ggaaccaacc caaatgtcca 

cacatataca ccatggaata ctatgcagcc 
gggacatgga tgaaattgga aatcatcatt 

55 

aaacaccgca tattctcact cataggtggg 
aaggggaact tcacactctg gggactgttg 
60 taggagatat acctaatgct aaatgacgag 
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tgtagatatg 


cggcgttatt 


tctgagggct 


1080 


tttggtacca 


gtaccatgct 


gttttggtta 


1140 


gtagcatgat 


gcctccagct 


ttgttctttt 


1200 


ttttttggtg 


ccatatgaac 


tttaaagtag 


1260 


ataacttgat 


ggggatggca 


ttgaatctat 


1320 


cgatattgat 


tcttcctacc 


catgagcatg 


1380 


ttatttcatt 


gagcagtggt 


ttgtagttct 


1440 


gttggattcc 


taggtatttt 


attctctttg 


1500 


tttggctctc 


tgtttgtctg 


ttattggtgt 


1560 


ttttgtatcc 


tgagactttg 


ctgaagttgc 


1620 


caacagggtt 


ttctagatat 


acaatcatgt 


1680 


cttttcctaa 


ttgaataccc 


tttatgtcct 


1740 


ccaacactat 


gttgaatagg 


agtggtgaga 


1800 


aagggaatgc 


ttccaatttt 


tgcccattca 


1860 


tagctcttat 


tattttgaga 


tatgtcccat 


1920 


tgaaggttgt 


tgaattttgt 


caaaggcctt 


1980 


tttgtctttg 


gttatatcaa 


atttacaaga 


2040 


gacatgaaca 


gacacttctt 


aaaagaagac 


2100 


tgctcaccat 


cactggccat 


cagagaaatg 


2160 


acaccagtta 


gaatggcaat 


cattaaaaag 


2220 


ggagaaatag 


gaacactttt 


acactgttgg 


2280 


aagtcagtgt 


ggcgattcct 


cagggatcta 


2340 


ccattactgg 


gtatataccc 


aaaggactat 


2400 


cgtatgttta 


ttgtggcact 


attcacaata 


2460 


acaatgatag 


actggattaa 


gaaaatgtgg 


2520 


ataaaaaatg 


atgagttcat 


gtcctttgta 


2580 


ctcagtaaac 


tattgcaaga 


acaaaaaacc 


2640 


aatggaacaa 


tgagaacaca 


tggacacagg 


2700 


tggggtgggg 


ggagggggga 


gggatagcat 


2760 


ttaatgggtg 


cagcacacca 


gcatggcaca 


2820 
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tgtatacata tgtaactaat ctgcacattg 
taataataaa ataaaaataa agtctctctt 
5 tagaactcaa tggttacctt tcgtgaggag 

agctgtaata tttcacatga agcccatctt 
aatcccatta aaatggctgt caaaaatgaa 
ttacacccta aaataaagta cataggccac 
aatgcatttg ttttttccta tcttatttcc 
15 tacccagtag ttctacaatc tggcttcaca 

tttgtgttgg gctgagtgca gtggctcatg 
gcgcgtggat tacttgaggc caggagttca 

20 

gtctctacta aaaaaaatag aaaaattaac 
cctactcagg aggctgaggc acaagaatcg 
25 gccgagattg tgccatggca ctccagcctg 

aaaaaaaagt tcaccttcat gttgggttca 
acttcccaga agattccact aggtagttga 

30 

ctaggctatt tccatgttta ccaataaatg 
cctctaaaac aacagtgcct tcattcaaat 
35 cgtttatttg aaaacacagc atgaggtcaa 

tgttggtttt attttatttg atggctcatt 
gtccgtctta atcttttatt aatcttacaa 

40 

ttcaacttgt gacttttgca acattataaa 
gcaccccaaa gtcaaaaacc ttaggtggtt 
45 gatatataaa ggtactgctt catgaagaat 

agtgtgtatg ggggggtaaa acagacagtg 
ccagcaagtc ttgagaaata tattgctttt 

50 

ctagactata ggtgaattca ctcgaagatt 
gttttgaact tgattattct ttaaggtatt 
55 gatcatgtct ccttgccatt gggaaatatt 

ttctattttt ccctttttaa tttgtttaaa 
cagccataaa aaagaataag atccagtcat 

60 

ttctgtcaag tgcaataagc caggcacaga 
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tgcacatgta 


ccctaaaact 


taaagtataa 


2880 


gccataaaaa 


aaaaaaagaa 

mmf 


tgtgttcatg 

mmf --mf mmf 


2940 


aatgcaaatc 

~mf 


aaggcaaaag 


aagattaaag 


3000 


gcactataga 

mmf ~* 


aaatgtactt 


atcaggatga 


3060 


atctgtttaa 

mmf 


ctccaaacta 


agatacaatt 

mmf 


3120 


cctagtagaa 


gaggtgatct 

•mf -mf mmf —J 


ttcttagaga 

•mmf mmf 


3180 


aactatctcg 


ttataaaggt 

mmf mmf 


cttttagaca 

mmf 


3240 


ttagcgttac 

mmf — * 


cttgggacct 

' mmf mmf 


ttaaaaaatg 


3300 


cctgtaatcc 


tagcaccttg 


ggagtttgag 


3360 


agaccagcct 


ggtcaacatg 

mmf -—mr +mf 


gtgaaaccct 


3420 


tgggcatggt 

—S mmf mmf 


ggtgtgcccc 

mmf m^f mmf mmf 


tgtaatccca 

-mf 


3480 


cctgaacctg 

_ * -mf 


ggaggcagag 

■mmf mmf mmf mm* mm* mmf 


gctgcagtga 


3540 


ggcaacagag 

— ' -mf — * — ' 


taagactgtc 

mm* mmf 


tcaaaaaaaa 


3600 


tgacaaatat 

mmf 


cctgatttaa 


tgtcttataa 


3660 


aagtagaact 


gttgtattag 


cattttctat 


3720 


acatgataaa 

•mf 


ttcttccagt 

mmf 


tttcttagga 

mmf mmf 


3780 


gctaagattg 

•mm* mmf >J 


gatttctagt 

mmf mmf 


ccattttcct 


3840 


aacaagaatq 

mmf mm* 


aatcaaaggt 

m-f mmf 


agtttctgat 

mmf mmf 


3900 


atttcaagac 


ctcttggaac 


aacaatcctt 


3960 


ttatagtaac 

mmf 


taatgccatt 


tcaacgtgat 


4020 


ctggtaaatt 

■mm * mmf 


atgaattaaa 

mmf 


cacgtttaat 


4080 


aagtgtttta 

mmJ mmf 


agagcttata 

mmf •mmf 


catgtgttaa 


4140 


gtgattttaa 

mmj 


tatgtacaca 

mmf 


catgggcatg 

mmf mmf m+f mmf 


4200 


caagcaataa 

mmf 


tatttagtaa 


gtagataaga 


4260 


tagtcaaatg 

mmf mm* 


gggcacttgt 


aaatgaatag 


4320 


tccttttcct 


tttggaaatt 

mmf mmf 


tgctacttga 

•J mmf 


4380 


tcagcactga 

mmf MM* 


aagctagtca 

mmf mmf 


gtctctcctt 

*—f 


4440 


gcatgcaagt 


ggaaaaaagg 


tctatgaatc 


4500 


aaatttttta 


atttatttat 


ttttatttat 


4560 


ttgcaacaac 


atgaatggaa 


ctggagatca 


4620 


aaaacaaaca 


tcacatgttc 


tcacttgtgg 


4680 
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gatctaaaaa ccaaaacaat tgaactcatg aagacagaaa gtagaaggat ggttaccaga 4740 

ggctgggaag ggtagtgggg gcctgggagg ggaggtgggg atggttaatg ggtacaaaaa 4 8 00 

aatagaataa ataagaccca ctatttgata gcaaaacagg gtgactgtag tcaataactg 4860 

tacattttta agataaagag tgtaattgga ttgtttgcaa tgcaatggat aactgcttaa 4920 

ggggatggat accccattct tcatgatgtg agtattatgc atttcatgcc tgtatcaaaa 4980 

tgtctcatgt acctcataag tatatacact aggtacttac aaaaatggaa aaaatagggg 5040 

gccgggcgtg gtggctcatg cctgtaatcc cagcactttg ggaggccgag gcaggtggat 5100 

cacgaggtca agagatcgag acaatcctgc ccaacgtggt gaaaccccat ctctactgaa 5160 

aatacaaaaa ttagctgggc gttgtggtgt gtgcctgtag tcccagctac tcgggagtct 5220 

gaggcaggag aattgcttga acccggaggc ggagcttgca gtgagctgag atcgcgccac 52 80 

tgcactccag cctggtgaca gagtgagact ctgtctcaaa taaatacata aaaaataaaa 5340 

ataattttta tgtatttatt ttactgttcc cccctcagct ttaatgaggt acagtcagat 5400 

acatatgttc tgcagatcta atgaacagca tgataactga agttaatact gtactatgta 5460 

cttgaagttt gttaagagag gagatcttaa atgttctcac caccaccacc aaaaaaaaaa 5520 

aaagatgatt atgtgatctg atgtgatcag tagattatct gctttgtgtt agtgacttca 5580 

caatgcacac atgtgtcaaa acatgttgta catggtaagt atattaaatt tttatttgtg 5640 

aatcttcctt tgaaagcagt ttctcacaat cattctctaa ggaattgaag aaagagctca 5700 

atcattttta ttatttactt tgaactcatg tgaagatctt agttattaag acctttacat 5760 

gcattctgtc atttatactt tataccccaa agtgggcact atgatcttca acttacaaat 5820 

gggagaaccg aagtcttaga gtttaagtaa cttcctttca tggctttttc tacagctcca 5880 

aattcaaaaa gagcatgttg aaagcatcca ctctttcctt tcatttcttc agatttcttt 5940 

ctcttcttcg ctgaaggcat tttcttgtca tctctttctt cttcagtatt ccatttgcaa 6000 

tggttctgga taaaacgggg agggaaggat cttatcttta ccaccttcct tttggctatt 6060 

tccttctgac ttctgtctct gtctctgtct tctctttcat cagtcccttc cttcgctcct 6120 

aaatgtagac cctggagagc cgctgcccta gggctcaggt gccagctctc ttagcagaag 6180 

cacctgtgca gaaattttct ggcaactggg cccctgctca tttcatattc acaatctctg 6240 

cccacacact atgttttggt cttggtaact ggccctcttt ctctttcctg tgatcacgtc 6300 

cccactgacg ctcctgtttg taggactagg acctcgcttt ccttgctgtg aacattcttc 6360 

tctgtctcac agactcccct ggggcatgga tcaggggctg tgccctcatt ctttctctca 642 0 

ggcctggccc tgtgaccttc ataaggattg tggtttgaat cttcacgtta cctacttcct 6480 
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tactccttgt cacttggctc tgatctttct 
aagcatataa tttggccact tttgaaactg 
5 gtaatcccag cactttggga ggctgaggca 

ccagcatgac caatacggtg aaaccctgtc 
tggtggtgca tgcctgtagt tgcagctatt 

10 

cccgggaggc ggaggttgca gtaagccaag 
agagtgagac gccatctcaa taaacaaacg 
15 aaaaccaaaa aacaaaaaag aaactgaaag 

aggcacactt cgggcttcac cattgtacaa 
taaattttag gaaagctggg taacatttct 

20 

gttaatagta gtacctgggt cctgggttat 
agcattcaga acaatgacct gtatgtagta 
25 attattatta tatctgtgat attgggcaaa 

gaatgtcatc acgcatataa ggtacatagc 
gcctgagtga ccacctccct cctccagatg 

30 

tgctatgcac ctaacactgt tctacctcca 
tattgttgct tgtgtttcct gttccagcaa 
35 cagttatccc tcacaccagt ttcctaatca 

agcgaggccc caaatgcaga tgtgattagc 
caccaagtcc tctaaagtgt tgcataaggc 

40 

gttatcggga aaagatgctg tagtgatctt 
gacttgggag gaaagccgtc tgccaaagcc 
45 atagacattc ccctttcttt tatatctttc 

atgtctgcaa aggtctctgt ttggttgcta 
atggcttctc tttaggatgt gatgattcct 

50 

ataatacagc aatttctttg ggaaaatgtg 
tcttaggtac attaaatccc actggctgca 
55 ggggtggtct cgatatgcct gaagtatggg 

gggacagtgg agtgtaatga tagcttgagg 
tgtgtgggct tgggaaagct acttaacttc 

60 

ggaaaggatc atagcagtag tacccattca 
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attgcctatt 


ttaggacatt 


tatttaggat 


6540 


aaaggaggcc 


gggcgcgatg 


gctcatgcct 


6600 


ggtgaatcac 


cggaggtcag 


gagttcgaga 


6660 


tctactaaaa 


atacgaaaat 


tagctgggag 


6720 


caggaggctg 


gaacaggaga 


attgcttgaa 


6780 


attgcaccat 


tgcattccag 


cctggatgac 


6840 


aaacaaacaa 


acaaatacaa 


aaaacctaaa 


6900 


gaggggtttt 


aacaatcata 


ttaggataac 


6960 


aacagagatt 


ttgcacaggg 


cttttccagt 


7020 


ataccatagt 


tttctctacc 


ataacatcgg 


7080 


tgtgaggatg 


agatgaatta 


acatatgtaa 


7140 


accatttaat 


acatttaaac 


aataattagc 


7200 


gtaattaacc 


tctctaagca 


tgtaagaatt 


7260 


accgttgctg 


gcacatcatg 


taaaataagt 


7320 


cttctcttac 


agtgtgactt 


tgtctcttct 


7380 


acccacaacc 


tcttcccact 


tgtctaccca 


7440 


tgtgttaagc 


aacatctgaa 


agccagaaga 


7500 


tcactaagcc 


ctggggaaac 


acaatggatc 


7560 


acactggtca 


gtcagcccgc 


aaaacacttt 


7620 


aaagataacg 


ttaagtctgt 


ggaggcttcc 


7680 


ttctgagtgt 


ctcctacttg 


cgacaaggtg 


7740 


tgaagcctcc 


aaggtataga 


ttccctctcc 


7800 


gttaaaatgc 


ctcccagtgg 


actactgggg 


7860 


tgtagaggtt 


tggaacattt 


aactgtgaac 


7920 


ttggagccat 


gataaaaagt 


gttagacata 


7980 


ggtatggata 


cacacacatg 


cacacgcacc 


8040 


cctgagtttg 


tctgccagct 


ctgataatga 


8100 








8160 

-L. w \J 


ttctgatttt 


tagcttctca 


aattgctggc 


8220 


tttgttcctc 


agcttcttca 


tctgttagat 


8280 


tctgatgttg 


tgagaactaa 


attagtaaac 


8340 
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atggataaag 


cccttagaac 


aaatgcctgc 


5 


actggcagtt 


tcctagaaaa 


tctacatttc 




agagagacag 


agtctcgctc 


tgtcacccag 




ctocaaactc 


cacctcataQ 


gttcaaacaa 


10 


gattacaggc 


gcgtgctgtc 


acaaccagct 




ttcaccatgt 


tagcaaggat 


gatctcaatc 


15 


tcccaaagtg 


ctgggattac 


aggcatgagc 




ttaaaaataa 


ataactgatt 


taaccaaatt 




c taaaaat t c 


t ccctcrat ta 


t tt taattat 


20 


agatatttgc 


tctcttcaga 


aatgctcttg 




tgcagaggta 


ttttccatca 


tcatttatca 


25 


tggttactga 


ttgaacatgt 


gtgttaactg 




gcactggagt 


tgcagtgatg 


agtagacaac 




acre t caacrc t 


oat t aaatcrcr 


t atciacraaQQ 


30 


aggggtgeat 


gcttcttcca 


gagttggcca 




atgtgggctg 


aaaatcagtc 


tggaagggee 


35 


cactttggga 


ggctgaggtg 


ggtggatcac 




aacatggtaa 


aaccccatct 


ctactaaaag 




arc fcafcaat c 


c c acre tact t 


cfaaaaQC taa 


40 


gaggttgcag 


tgagctgaga 


tcgggcctct 




ccaactcaaa 


aaaaaaaaaa 


annnnnnnnn 


45 


nnnnnnnnnn 


nnnnnnnnnn 


nttcagtctg 




ttgaggggac 


ttgtctttca 


ggaatggggg 




1 1 tacraactt 


c tcfa.aacra.acf 


acaaat caaa 


50 


ggatgggtga 


ggggtgaggc 


aggagaegta 




cattctccgt 


attgeatgea 


tcttagttca 


55 


ttcccagtgg 


ataaatttga 


agtcttggct 




gataattacg 


ttaggaactt 


attcacatag 




cgctcttggg 


ttctgtacag 


tgttgaacac 


60 


tttatccata 


cttaaataca 


tacagaatgt 



cacatactaa gagctggaga gatgaactta 84 00 

tctgattttg tgtgtgtgtg tgtgtgtgag 8460 

gctggagtgc aatggcacga ttteggctea 852 0 

ttctcctgcc tcagcctcct gagtagctgg 8580 

aattttatgt gtttttagta gaaacagggt 8640 

tcctgacctc gagatcctcc cgcctctgcc 8700 

cactgcaccc ggaccatttc tctgattttt 8760 

ttagaaaaca tagtttgttt cacctggagg 882 0 

ttgaaacaaa tcaagttaaa taatatggtt 8880 

gagtttcttg agatgagagg ctccagtgct 8940 

ggatggctac cttgggctgg accaaaagtc 900 0 

agtgcttgcc atctgatgat tctgagctag 9060 

gaggtaacag acaagtatcc aggcaattaa 9120 

aaagttgagg aaattttgag agcaacccaa 9180 

ggactaaaaa cagcatcctg gaagtatgac 9240 

gggcgcattg actcacgcct gtaatcccag 9300 

ctaggtcaag agttcgagac cagcctggcc 93 60 

tacaaaaatt agctgggtgt ggtggtgcac 942 0 

ggegggatea tagcttgagc ecaggaggeg 94 8 0 

gcactccagc ctgggagaca gagcaaagct 954 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 96 0 0 

gaaggagagc agaaattatc caggtgaagc 9660 

atagcacaga ggattgaaga catttgaggg 972 0 

aaaataaggc tggatggagt tttgagatag 9780 

ggegagagat tatctggtca taaagcagca 9840 

ttagatattt aatccaaaat acacacaaat 9900 

ttttggctat tcttatttat actttttgea 9960 

agtttgcttt aagctttatt gatcttccct 10020 

ggggggttaa gggttccttg tcatttcaca 10080 

aatttaatct tggcatgaaa atcaatattg 10140 
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gtaggaaact ttgatgaaga gtattggtag 
ttatcagtat tggtagagaa ttttatgaat 
5 acatggcttg actctcactt ttagtcaatg 

gtcatacaca cccccagagt agagtatcag 
ttagcttgac aaaaaattaa gttattagaa 

10 

gagtagaaat taagatacta tactttcttt 
gggtacatgt acaggttcgt tatataggta 
15 ttgtcaccca ggtactaagc ctagtactca 

ctctctgata atctccagtt gttttgttcc 
agctcccact tataagtgag aatatgtggt 

20 

taaggataat ggcttctact tcatccatgt 
tatggctgca tagtattcca cagtgtgtat 
25 tgatggacat ttaggttgat tctatgtctt 

tcacgtgcat gtgttttatg gtagaatgat 
gatagttatg gcagaataca gcatcatttt 

30 

tctccctcat cagttataaa agcatataag 
ctatgaatta ggaagttatc ttgcccaaca 
35 tactttagtg atacatattt ttcagctgta 

tatatataca aacttgaagt cctttttgtt 
ccacctgttc acaggttgag aattccgtac 

40 

ataaaattaa aattattccc atttttacag 
catgcaacta gtaagtgatg aaatctggat 
45 tctgaaaact gaggttccca ttactaggtc 

tgaatttgga tcttccatag tgcttaaact 
gtcttctata tagtactata taatgtggtg 

50 

tgtttttagt aaaatctgaa atgatttata 
ctctctatac ctcaattttt ttcatctgtg 
55 attaagtata ttaatacatt cacagcatcc 

taaataaatg acacctacta ttattactgg 
agaccactgc tcagagaaac catatgtctg 

60 

ggactatata tcagtgcatg caaaatgaga 
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caaaatttta 


tgacatttat 


aataaatgtt 


10200 


ttttatagca 


attcccagtg 


taataataaa 


10260 


cttagtaaat 


ttgtcatcac 


acttaaatga 


10320 


aaaggtaaac 


atcctgaaag 


tggcgtgtat 


10380 


aaatggatct 


agtatataat 


ttgtaacaaa 


10440 


ttcaataaaa 


agcttttatt 


ttaggtttac 


10500 


aattcgtgcc 


acaggggtgt 


gttgtgcaga 


10560 


atagttattt 


ttcccgatct 


cctccctcct 


10620 


cctctatgtg 


tccatttgtt 


cttagcattt 


10680 


atttggtttt 


ctgttcctgt 


attagctttc 


10740 


tcctgcaaaa 


gacatgatct 


cattcctttt 


10800 


gtaccacatt 


ttcttcatcc 


agtctacaaa 


10860 


tgctattgtg 


aatagtgctg 


caatgaacat 


10920 


ttatattaag 


atcttattct 


taacccaatc 


10980 


atgtgtttgt 


ttatggttca 


tctcatgcca 


11040 


cattttacaa 


ctattttgtg 

* 


taggagggta 


11100 


ttcagaagtc 


caatgtaagc 


attctgtgtg 


11160 


atgattgagg 


tggttgtttt 


gtgaccctta 


11220 


agaaggcaga 


gtataagtta 


aaagctgata 


11280 


attatcaaag 


tcaactctcg 


caatgttaat 


11340 


aagtgaaaac 


tgaggttccc 


actacaaggt 


11400 


ttgaattgga 


atcttccata 


gtgettaaac 


11460 


atgcaactag 


taaatgatga 


aatctggatt 


11520 


ctaccatgct 


atgttggtac 


tcagaaagac 


11580 


atttgaaaaa 


tattggacct 


tagagtagaa 


11640 


agccatatga 


gtttgggtaa 


gtttctcggt 


11700 


aaacgggaat 


aataataata 


ttataacagg 


11760 


arrapa ri t~ etc* t~ 
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aatgtgacta 


aagecattat 


tcttataaag 


11880 


gttgcccttg 


actgacttga 


agaaaggcag 


11940 


aagaaattgc 


tctttaccat 


tccaaagtta 


12000 
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agtgaagcac 


caaggcctac 


tacaatgacc 


cgatgatgtc 


tactactcca 


aggcaagaca 


12060 


5 


gaattgaata 


ttgtacaaac 


tctgttactc 


tgcctggtgt 


cccatttgtg 


gagagtcatt 


12120 


gtaatgtatg 


gacagaattg 


aacccttctg 


taacactcaa 


aattttagta 


gctagtggaa 


12180 




aaagggaaag 


catgaaggct 


ttggatttct 


tttctatcac 


agecataaac 


aaccccaatg 


12240 


10 


gcctcccacg 


aagttgataa 


tgcagagctg 


gggtcagcct 


ctgcccatgg 


taccccaggc 


12300 




agtgagacgg 


gaccagaaga 


gctgaatact 


tctgtctacc 


accccataaa 


tggatcacca 


12360 


15 


gattatcaga 


aagcaaaatt 


acaagttctt 


gggtaagtca 


gccttagttt 


aaacactgat 


12420 


ttaagaggga 


agaaaataaa 


tacagttgac 


ctgtgaataa 


cgtgcgtttg 


aactgtgagg 


12480 
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20 


ccctgctctc 


cctcttcttc 


ctcagtctat 


tcaacgtgaa 


gaggatgagg 


atgaagacct 


12600 




ttatgataat 


ctacttccac 


ttaatgaata 


gtaaatacat 


tttctctttc 


ttatgatttc 


12660 


25 


cttaacgttt 


ttagtttctc 


tagcttattg 


tatataatat 


gtcacataca 


aaatatgtga 


12720 




taatcgactg 


tttatattaa 


tagtaagact 


tctggtcaac 


agtaggctat 


tagtagttaa 


12780 
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30 


accccaatgt 


tgttcaagag 


ccaactgtat 


catgtagaaa 


accttgetta 


tgctgagaag 


12900 




gaggaaattc 


tgagttattt 


atcactgact 


taataatgat 


tggagaaaag 


acagggactt 


12960 


35 


cttatagatc 


tggggagcag 


aaatgcaaag 


gcattttctc 


atggtagaaa 


acatcaaacc 


13020 




ttggagaaca 


gattccaaga 


cccttcattt 


tttgttttgt 


gecttttgae 


cagecatttg 


13080 




aggaggc uga 
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40 


gaagggttaa 


aaattgggtt 


gagagaaagg 


ctgagtagga 


tatggcttat 


cctctgtctg 


13200 




ccttcatcat 


attatgatgc 


aaaagcatgc 


tgggaagaga 


ctttgactcc 


atctacctta 


13260 


45 


ccaatcatgt 


ctgttttact 


gtcctgccac 


tttctgagag 


gagttcgtga 


gaatgaacag 


13320 




aaatgtgggg 


gagaggtgaa 


gacaaaatgc 


tgccactgga 


attggtgtgg 


atagcgtgga 


13380 




dtaaaaadCL 
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LLL LCdydLd 
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50 


ttcaagcacc 


gacatctcct 


ttctgaggaa 


aattaaaatg 


gcctgttttc 


agtgtccact 


13500 




ttagtatatg 


actgttacgc 


ctttttcccc 


tttcttcttt 


caggccatcc 


agatcctgaa 


13560 


55 


tgcagcaatg 


attctggctt 


tgggtgtctt 


tctgggttcc 


ttgeaatace 


cataccactt 


13620 




ccaaaagcac 


ttctttttct 


tcaccttcta 


cacaggctac 


ccgatttggg 


gtgctgtgtt 


13680 




tgtgagtatt 


cgtactcccc 


tggctatatg 


ttatttctta 


gataccactg 


ctggagatgt 


13740 


60 


gtttgatgac 


aaaaatattg 


tttgtaggtt 


attggaaggt 


tgcatgctgt 


ggttttgatt 


13800 
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tgcaattctt gatataaaca attagagacg 
aattacttaa ttaatttatt ttttgagaca 
5 cagtggcatg atctctgccc actgaaaccc 

cctcagtctc ctgagtggct gggattacag 
tttttgtatt tttagtagag acagggattc 

10 

tgaccacaag tggtcctccc agcttggcct 
cctgatgcca ggccaaaaat ctctttctga 
15 tgaagtccta aaaattatct tatatataac 

ttgaatgaaa aatgggtatt gatgaattgc 
gagttaagaa atttgtctgc tgattactgt 

20 

tgggtcaaca ttcaatcaga agaaataggg 
attatttatt acttccctga tggttcattt 
25 tttgtatatc ttcttttgag aattgtctat 

tgtttgttta tttcttggtg atttgtttgg 
tgttagatgt atagattgtg aagattttct 

30 

tgactgttcc ttttgccgtg caaaagctcc 
ttgattttat tgcatttgtt tttggcttct 
35 cctaagtgaa gtaactcagg agtggaaaat 

gagctaagct atgaggatgc aaaggcataa 
gggaaaggtg ggaaggaggt gagggataaa 

40 

ttgggtgatg ggtgcaccaa aatctcacaa 
aaacaccacc tgttccccaa taacgtatgg 
45 taaaactaat tgcatggaaa gactaaataa 

tgagcagcat caagtacaat tctgtgtcct 
gactattgct ggatatctgc taattgaggg 

50 

tattccttta tctgttttcc ttcctatttt 
tgttgtagca gggataaaac ccacaagaac 
55 gatcagagga gtagaaatat attattcata 

aactgagaca ctgagtatca tttaatgcac 
ggtaggtcag cagatcacat acattgtgag 

60 

ttggtgaact tgaagcttag aaaaagtctt 
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ataaactctt 


ttttattttt 


taatttaatt 


13860 


aagtctcgat 


ctgccatcca 


ggctggagtg 


13920 


tctgcctccc 


cagttcaagt 


gattctcttg 


13980 


gcatgcacca 


ccatgcccac 


ctaatttttg 


14040 


accatgtttt 


ccaggatggt 


ctcgaactcc 


14100 


cccaaaatgc 


tggaattgcc 


aggcatgagc 


14160 


ttaagagaaa 


ccacatttgc 


caagtacaaa 


14220 


tatatgttac 


atttatagat 


acttaaaaaa 


14280 


cttgtggtac 


agcagagagt 


cctggcttga 


14340 


ctgaaagtca 


gaagaatcat 


ttccattatc 


14400 


gtgattttaa 


aatgtattca 


ttcagttttg 


14460 


ttaaaaagtg 


aatgatatgt 


ttgttggcca 


14520 


tcatgtcctt 


agcccatttt 


ttgatgggat 


14580 


gttcattgca 


gattctggat 


attagtcctt 


14640 


cccactctat 


gggttgtctg 


tttactctgc 


14700 


ttagtttaat 


taggtcccag 


ctatttatct 


14760 


tcgtcattaa 


atccttgcat 


aagccattat 


14820 


caaacatcat 


atgttctcac 


ttataagcgg 


14880 


gaatgacaca 


atggactttg 


gggactcagg 


14940 


aggctacaaa 


ttgtgtgcag 


tgtatactgt 


15000 


attaccacta 


aagaacttat 


ccatgtaacc 


15060 


aaataaaaaa 


aatttaaaaa 


ataaaatgaa 


15120 


ataaataaat 


aaataaataa 


ataaataaag 


15180 


aggatgtgaa 


tttaaaaggg 


aggtttgtat 


15240 


aggaccaaga 


agaaaatggt 


tgagacttaa 


15300 


caaacagttc 


tgtagttcag 


gaaccttgtc 


15360 


atgggtaagt 


agcacttcct 


ctttttctat 


15420 


aacaat a era a 


caataaatat 


ccataaatac 


15480 


aaaatattag 


ttttcttaac 


gtgttcagat 


15540 


gatgcattcc 


tgggatgctt 


gctcttaata 


15600 


cttgattttt 


atgtgttgtc 


aagtttgacc 


15660 
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aattctaatt 
aaacttctat 
cactgatact 
ttaccatttt 
atgatttttc 
actactaatg 
aaattttgtg 
tttccccaca 
gtctcactct 
accttccagg 
gtgtgccacc 
tcttgccatg 
gcctcccaaa 
taaacatcca 
ccttgacatt 
aactccttca 
acccacttca 
gatcactgaa 
aagctatgag 
cttggcttcc 
cagtctcctt 
gctgaattct 
attcttatga 
caggtgtctc 
tcttcctcaa 
accaaacacc 
cagcaaataa 
atctttctat 
ttttatcgtt 
caagtatatc 



atttggaagt 
tctaaacaac 
ccactttcct 
attgtatcct 
tgtctcttct 
gactagatct 
attttcttta 
caagtgcatc 
gtcacccaga 
gctcatgcaa 
atccctggct 
ttgcccaggc 
aggctggggt 
gttactgttc 
atttctccct 
aaggaatcat 
ataagacttt 
aacttccacc 
tagcatttag 
agcatacatt 
tcctggttcc 
ttgccacctg 
cttcaagtaa 
tcctgatcat 
accttctcct 
ttagaataac 
tgttggctct 
tgatgacatt 
gaacttctct 
atttatttgc 



aagaactccc 
tgctaagtct 
actccattca 
caaacaccaa 
gagaaaactg 
gtaccaatgt 
aagctgatct 
actttaattt 
tgggagtgca 
tcctcccacc 
aatttttgtc 
tggtctcaaa 
tacaggcgtg 
tcaacttaaa 
gctagggaca 
catgttcttt 
attttattcc 
ctgctaaaac 
catagttagt 
gcttggctgg 
tctccttgtc 
ctcttctcaa 
tatctgtatg 
cagactggca 
catgtcagct 
caagagtctt 

* 

actttcaagc 
tcatccaatc 
ttgccttgct 
tccaaatcct 



accactaact 

ctatgatgtg 

ttcactttcc 

tcccacttct 

aagccattag 

atttttttct 

ccctacttga 

ttgtctctct 

gtggtgtggt 

tcagcctctc 

tttttttttt 

atcttgggct 

agttaccttg 

aacaaaacaa 

ggcacaattt 

ctctgatttc 

ctactatcct 

aacggtcaac 

cgcccactcc 

ttttcctccc 

tctgaccttt 

tctatacgta 

cccacaaccc 

tgtaaagatc 

gatgggaatg 

ctttctttca 

tattattata 

caccttgtct 

agaaacagtg 

tcattggctt 



agtttgacaa 
ttcttagtga 
ctcctccatg 
gtcatcctta 
aagggaattt 
tctagcctct 
gtataagatt 
gctatatata 
cacagctcac 
acatagctgg 
ttttttttgt 
caagcagtcc 
ccccatctct 
aacaaaacaa 
gtattttcct 
catctaccgg 
tctgtaactg 
tcttgttcct 
tccttccagg 
atcttttgct 
taatgttctg 
cttcttctgt 
tgaaataagt 
caaagctcaa 
tcatccttcc 
caatcctgaa 
tccagttttg 
ctcctaattg 
atcccttgaa 
tgtatttttc 



ctgaaagtcc 

gtcacgtagc 

gatagctatt 

ctctctgctg 

tcagacctcc 

taccatagga 

ccattctttt 

tatttttcga 

tacagcctcc 

gactacaggt 

agagacagga 

tcctgcctca 

aatcagcatg 

aacaagaaaa 

tctgtagcta 

tttgaatcaa 

ctgtctttaa 

catctaactt 

aatttcctca 

ttctccatct 

gtgccccaga 

gatctcacac 

atttccagcc 

ctcccagtct 

agattcccag 

tcctatttat 

accttttctt 

ggcttctagc 

accataaagc 

ttagagttag 



15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
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aaacaagttc 


cttttcatga 


tctgcagagc 




cttcttcttc 


atttcctagt 


attgactacg 


5 


cccagaagat 


tctggcctca 


ggattttgag 




gatatcctcc 


tggctggcac 


ccacagctct 


10 


taatatcatg 


aacccccact 


cctctctttg 




tagcccctag 


tacttaatca 


ctttcaaaaa 




ttttttcttt 


gtttttattg 


ctagaatata 


15 


ttctttggtc 


attgatgtta 


gcctcgcatc 




caatcaatgt 


ttgtttcaat 


gaattgttgc 


20 


aattgaaaat 


tcttattctt 


ttttagatac 




ctacaattgc 


actagtgggg 


actgcttttc 




cattaaggag 


ttgtcactct 


tcatcagagt 


25 


tatcaaatgt 


atgtttctga 


aaaatatgta 




tgtacagaaa 


aataatatat 


caggtaccaa 


30 


ttttttgaga 


cggagtctcg 


ctctgtcgcc 




tcactgcaag 


ctccgccctc 


ctgggttcac 




nnnnnnnnnn 


nnnnnntttt 


tttttgtatg 


35 


ccaggatggt 


ctcgatctcc 


tgacctcgtg 




ggattacagg 


catgagccac 


cgcgcccagc 


40 


tcaggtactc 


tacctcttct 


gaactctact 




catttcatcc 


cagtgatcta 


tgaactgtta 




tctttttatt 


attattatta 


tacttgaagt 


45 


tgttacatat 


gtatacatgt 


gtcgtgtttg 




ttaggtattt 


ctgttaatgc 


tatccctccc 


50 


gtgtgatgtt 


ccccgccttg 


cgtccaagtg 




agacactact 


tatggaagtt 


tgtgttctca 




tttatatttg 


tcggagcctg 


atttttctgt 








yy uy uocty 




tttatttttt 


gtttatatta 


gctcagtgtt 


60 


acccctatga 


agtgctaatg 


aaattgtgga 




cctactcctc 


tccttagttt 


actactattc 
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cccacatgat 


ttggctctct 


gttaactttc 


17520 


tgcctccaac 


gcccttgacc 


cctttgctgc 


17580 


tgtttcccta 


atttcaagga 


tattccccca 


17640 


ttcatgcctc 


tccccagtgt 


catttctatt 


17700 


gattcccaga 


ttctccttat 


cccattctgt 


17760 


tattatgtaa 


tttacctata 


gtatgtttac 


17820 


agctccatgg 


agggtggggg 


acagaaatct 


17880 


taggatatca 


gtgggtgtgt 


aaacaatatt 


17940 


caaaggaatg 


aagtacgtgc 


tgcatatatt 


18000 


agaacagttt 


tggaatgaac 


attgccagtg 


18060 


tctcactaaa 


tatagcagtt 


aatatccagt 


18120 


caccggacct 


atgcaattac 


atgggctcca 


18180 


taagtataaa 


atatttcaaa 


caatccagga 


18240 


tgtgccataa 


tttgaatctt 


tttttttttt 


18300 


caggctggag 


tacagtggag 


tgatctcggc 


18360 


agcattctcc 


tgcctcagcc 


tcnnnnnnnn 


18420 


tttagtagag 


acggggtttc 


actgtgttag 


18480 


atccgcccgc 


ctctgcctcc 


caaagtgctg 


18540 


caaccttacc 


aattttttaa 


aaatttgtgt 


18600 


tagaaattaa 


tattttcctt 


tggaattatc 


18660 


attacagact 


cactcctaag 


cttcctgttt 


18720 


tctagggtac 


acgtgcacaa 


cgttcaggtt 


18780 


tttgctgcat 


ctattaacat 


gtcatttaca 


18840 


ccagcccccc 


accccacgac 


aggccctggt 


18900 


ttctcatagt 


tcagttccca 


cctatgggtg 


18960 


ttttctttat 


tacattttta 


atgatctaaa 


19020 


gaatttggag 


gtatttttcc 


tccagggata 


19080 


taat aatacrt 


aaaat cat t a 


a cccaacaat 


1914 0 


ttagtttcta 


gggataatgt 


aaatgcctaa 


19200 


agggccattt 


ttgttcattt 


ttctaaactg 


19260 


agattggaag 


atgttagtga 


taatgtcaga 


19320 
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attttgtttc 
gacacatctt 
ctacagtgtt 
agggagatgt 
actgggtttg 
accttgctgg 
tgtaattcaa 
cctgtagggt 
gtggagttcc 
aatttggatg 
tatacagctg 
cctaggaaaa 
atattttcat 
tggcacagta 
ggatgggagg 
aggaaatttc 
ctgagagcat 
aggagaacat 
ccttggacgt 
gtgggtatgg 
atcattactc 
ttgctttatg 
aaatccaact 
aatgcttctt 
tatcattaaa 
tttcagtgaa 
gagtctgaga 
aactaaagaa 
ttattacctg 
gtaaaatagg 



tacagcttct 
tctcattaga 
atatgtgtct 
tgtagtctgg 
atataaggca 
aattatgcgt 
gagagtgaga 
ttgaagttgt 
tggaattaga 
tgaacatgca 
taattaagtt 
attaataatt 
tctcttctaa 
aacatcaatg 
agaaggagat 
ctcacctccc 
gaatatttga 
aagcctgctc 
ttgactgact 
aaggacagat 
caaagttgtt 
aggtttaagg 
tctgaccgcc 
gactttaaca 
ataaattttc 
ctgctggaac 
gcaagcccaa 
taaggccttt 
tatgaacatg 
tgtaataata 



tccccctgta 
gaaaatgttt 
ctgtctttaa 
tttcctgctg 
tcatatcccc 
aactatctct 
tttttcaaat 
ttctgatgct 
gattgtatga 
agaggaaata 
ttattatttt 
ctttctaaat 
tgtttgttgt 
aatggatttt 
aatgatcctg 
aattctgtgt 
ccttaaatct 
gtaaagctca 
ctatcctttc 
atatttcttt 
tccagaaatt 
aaggaaggcg 
cagtaggaag 
tcagcattat 
actgtatttg 
tcacacatgc 
atgtgttctt 
agaatctgac 
ggcaaattat 
acaactactt 



gctacttcac 
tcatgctttt 
tttatggttc 
acattttttt 
tagggcatgg 
accatagcca 
gattaatcat 
tagaaataaa 
atgttagaga 
ctgtacttga 
cttataactt 
tttagatata 
gagtactgtg 
attgttgatg 
ttcttggaca 
aatcaagaat 
ccagtgactc 
atccttctat 
tctcctaact 
aggcattctt 
ggttctattt 
gtataatccc 
aaaaatgaga 
aaaaagtgtc 
agatgggagg 
cctgatatgt 
caaaggacaa 
acatctgggt 
ctaatctctc 
tgtcggttgc 



atgcgttgac 
agattgagat 
actttgttcc 
ttttacctct 
tgtctctact 
tgtggtgcaa 
tcacatgatc 
cctttatgtg 
tggaatgaag 
tatataagag 
gtaactttct 
ttaactgtaa 
tatataatct 
attatgatga 
ttaatgttgt 
acctccttat 
agagcttcac 
catggcacca 
ataaatccta 
ggatatctgt 
cttcttatcc 
tattcaatat 
cattttttcc 
aaataaaaaa 
gttaaggctc 
aaatgatgat 
tgggaaactg 
tcaaattctg 
tgatctattt 
tctgagggtt 



atagaaatgt 
actatctctt 
actttacggg 
ggatgttgcc 
gctgattctc 
tgcaaactgc 
tcaaagcctc 
gtaatgattt 
ggtagaaaag 
catactttct 
aattgtatga 
aatgaaaata 
ttatggcatt 
taacaggaag 
tttattttct 
gaaaataatt 
ccacaaactc 
atcacaagaa 
tttgtgtgtc 
aacttctatg 
acctactcca 
attttttcta 
attacagaga 
ttaccatcat 
agggatttta 
ttatgttggc 
taaagtagag 
aaactgtcac 
ttcctcatct 
aaatgaaaat 



19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
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aaaaagaaaa 


tgtgaaacag 


caccacaggt 


acttgacaat 


gatgttgctg 


ttgtggttgg 


21180 




tcttgctgtt 


gctggtgaaa 


gctaaggtgg 


gattgtggtc 


agatgtgctg 


gagagataca 


21240 


5 


gtggaaaact 


tacttacact 


tccattgtgg 


ctcctcgtat 


aacctccaac 


aactccaata 


21300 




gccatctttt 


cgaggtggca 


atgtggcgtg 


aaaacttccc 


tcagttcctc 


gtttataaat 


21360 


10 


ctgctaaagc 


acttatattt 


cttattcata 


aatatccaag 


aaaaataaaa 


tttatgaata 


21420 


agaaatataa 


gcacttattg 


tgtccataaa 


tttataccaa 


taatttattc 


ttagttacgt 


21480 




gtatatcgta 


tgtgcaaagg 


ttcttttcta 


acacttgttg 


cctacttata 


taataattat 


21540 


15 


ttctttattt 


tttaatccat 


tctgcagcgg 


gataattaag 


gaaccagaga 


gactgagggg 


21600 




ttgaggagga 


attatttaat 


tatttaggtg 


caccgaccca 


gtcggattaa 


catccaaagg 


21660 


20 


actgagcccc 


aaacaaagag 


tccggttacc 


ttttaagcat 


tttgtgcagc 


gggggacgat 


21720 




ctgtgcaggg 


ggaagcatat 


tacagaagtg 


agaaacaaag 


acagttattc 


aattaagaca 


21780 




tgtattacat 


tatttcttat 


tttttcaagg 


aaaaacatgt 


tttacgactt 


gagtttatct 


21840 


25 


gcctagtgac 


cttgcacctg 


cacagctaga 


gaaatagggt 


cttcacaatg 


cctgggaaag 


21900 




ggagagataa 


ggcttactag 


ccgcagaaaa 


acaggcagtt 


aattttaaag 


gactccagct 


21960 


30 


ctttctcttc 


ttcaggggga 


attgggtttc 


tcttacatac 


aaccgagctt 


ttgcttacac 


22020 




attctttaat 


ttcttttaat 


tcctgttcta 


attcaacaat 


catttaggga 


gtgcctacag 


22080 




tgtgtcacgt 


attgtgctaa 


gagcttggca 


tataatagag 


aatgatgcaa 


aatctctgct 


22140 


35 


cttaagttgc 


ttgtagtggg 


agatacagac 


ccctagaaac 


acacaagggc 


tctcatagga 


22200 




agagtatagg 


aatcttgaga 


agcacgtagg 


acgagattct 


aacccagatt 


agaagacatg 


22260 


40 


tcagtgataa 


ttttttagaa 


cacttttctt 


ctcaactgag 


ttcttaaaga 


taactgagcg 


22320 




ttaacatgat 


ggaaggcaaa 


tggaagaaaa 


gaggatacat 


taaaagcctt 


taagttagaa 


22380 




aaaattttga 


atattcaaag 


aataaactag 


agactttagt 


gggtctaact 


tcagtgtata 


22440 


45 


atgggcagat 


ggaaaaataa 


aagtttggag 


agaggtgtac 


atcacaagag 


attataaaat 


22500 




gacagcactg 


tgagtttttt 


tttatttttt 


ttatttttta 


ttttttattt 


atttattttt 


22560 


50 


ttattgatca 


ttcttgggtg 


tttctcacag 


agggggattt 


ggcagggtca 


taggacaata 


22620 




gtggagggaa 


ggtcagcaga 


taaacaagtg 


aacaaaggtc 


tctggttttc 


ctaggcagag 


22680 




gaccctgcgg 


ccttccgagg 


tgtttgtgtc 


cctgggtact 


tgagattagg 


gagtggtgat 


22740 


55 


gactcttaac 


gagcatgctg 


ccttcaagca 


tctgtttaac 


aaagcacatc 


ttgcaccgcc 


22800 




cttaatccat 


tgaaccctga 


gtggacagag 


cacatgtttc 


agagagcaca 


gggttggggg 


22860 


60 


taaggtcata 


gatcaacagc 


atcc 








22884 



-115 



WO 02/062946 

<210> 76 

<211> 51959 

<212> DNA 

<213> homo sapiens 

<220> 

<221> genomic_DNA 

<222> (1) . . (51959) 

<223> n is an undetermined nucleotide 
<220> 

<221> initial__coding_region 

<222> (29933) . . (30028) 

<220> 

<221> coding_region 

<222> (34943) . . (35011) 

<220> 

<221> coding_region 

<222> (38775) . . (38831) 

<220> 

<221> coding__region 

<222> (40333) . . (40492) 

<220> 

<221> coding_region 

<222> (42979) . . (43079) 

<220> 



PCT/US01/48437 



(dATP , dCTP, dGTP, or dTTP) 



116- 



WO 02/062946 



PCT/US01/48437 



15 



25 



35 



45 



55 



<2 21> coding_region 
<222> (43668) . . (43736) 



<400> 76 

nncaactccc tcaactggaa gttacctgat tttttctcat gattagactg atgtttgggt 60 

10 tttgtggaag gagatcacag atataagagc cagtatcatc acatcccatc aacagcccac 120 

actatcaaca tgacttacca ctgttactct tagtcacttg gttgaggtag tgatgtcagg 180 

cttcttcact gtaaatttac tctttttatt cccctttaca ttctgtattc tttggaagga 240 

agtcactatt cacagcccag acttaagggg tggggagtta cgtttatgaa tccaaagccc 3 00 

gatggcctgt gtgttctttt gccataggct ccactcagtg aactgagagg ttgctgagac 3 60 

20 ctgtgattca aacagcaaat ggagtcaatg gggtccagga gaagagtggg gtgggtggcc 420 

ctttagaggt cctcctgagt gtcctgttgg ttggggtcca actccatggt aacagtcttt 4 80 

atggtaagcc agtagatgag tatattcaag agatctacat caggtattgt aggagtgata 540 

ccagtagcca aataggtcta ccaattgctg gggttttttt tgtggttttt ggtggggagg 600 

gacagagaag ttcaaaatct tgacaatcat ctttgcaatg agaggctttt ctgctttctg 660 

30 aattatgccc aggagtttta ttagtttcat ttggatatta ttaggtcttt ttatcttgtc 720 

tgtgttaatg gcccagctgg gttgctgcat gtggatcaac agggtgtcta gtccttgttg 780 

ggcagtccct tcatctgagc ccaccaggaa gagacaatca atatagcaga agggcagagc 84 0 

tttctccagg agtgagactg tgtgtaactc atggcccatc actccatggc agatacctgg 900 

gagcttaggt ggccttgcct gtgcatgtgc tctgctgtat tggtggtggc tgttctcagg 960 

40 gcactgcaag ggacagtgtc tggagtgaca tggctgcatg gtgcccctct ccctcactaa 1020 

gttctacaag attgccttct ttatcatgct tacctctgaa gtctgcagtg aaatattatt 1080 

aaaccaccca cgcattcgca gtataattac aacttggtaa tgatatatgt tacactgtgg 1140 

gattatattt ccttgtgttt taaatatgat ttttcagaga ctgaaaaata tctgaaaact 1200 

tctccatctc ttagtgtgct ggttgggttt agatcaagat cttttaaaat gaatttgtga 1260 

50 gcattccctt tttaaaaaaa aatcttctga atattttgaa taaaatttcc attgtctatt 1320 

atttttagta tcatataact cacttctcag ggcatggtgt tttatttatt taattattta 1380 

ttcaatttgc atatgtatat tttaaattac ttctaatttt cagggtatta atatttattt 1440 

taaatagtaa gtatagtgta ttcattttta tttagattac taatatatca gattttaaat 1500 

gtataacatc ttatattttc ctttctattc accattattt aattgttctt atttttccaa 1560 

60 tttctttttt tctattagac tgccaaaatt ttgtttaatt tccttcttat tatttgatgg 1620 
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aagacccaag aattgcccta tggattaaac 
aagttgtacc atatttaagt tggcttcgca 
5 aagagagaaa aggcagtatt cctggaaaac 

agaaacacac attactattt aggcagagtt 
taatgaagta ttaagtgcat ggggcttttg 

10 

cacatcagtt agctgtgatc attgttggaa 
tagggcagac aaaggagtga cctgtagtat 
15 cactttcctc ttcccagcaa tctgtgactt 

tgactccgtc cctgcctcca ctgtgctctg 
ccccctttcc actgcaatga ttgaatcagc 

20 

gaatgattct caatcgtctt gttttgaaac 
aaactacagg gtgatatatg gatgtaaggc 
25 ggtgggaaat cagcctaagc atgaagccaa 

cacaaaggta cacaagagtt ggggctggag 
agctccaatc ttatctgaag ttcacctcaa 

30 

ttttttgttt tgactttctc tatatgataa 
gcatttagaa acacaaaaaa tgatttgttt 
35 cagattgtaa tttagataag taaacaatca 

ggatgaaaat tgaggagaga aaaacaaaga 
taggttccca cttcaaacag cttatccagt 

40 

acccccttgc ctttgctgca ccccaacttg 
aaccccagtc cacattcctt tccttattag 
45 aatgacccct cgtctctctc ttctcacctt 

taaatgttta gccagtcacg actagttaga 
aacacacaga agcagaagct gcattagaga 

50 

gtgctctcgc cattgctcca tatgcgagac 
ctgagagacc ctttgtcctt tgtttcagcg 
55 tttccaacaa tggggacagg aatggaaaaa 

ttctcttgac taatatatga aaaacactgg 
cccaccagga atttaggctc cttatgggta 

60 

aaaccgatac taaattgaca aatcttgatg 
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CL CI L- v_- CL <-*.y 


Cl d. w ex. ^ ^ C*. 


t acrcrc c c t c a 


1680 

-*— \J Vp/ Vp* 


ttttctcaat 


tctcatacta 


aatatOQt tcr 


1740 


aaaactaaaa 


ataactaaac 


tcftcaoocca 


1800 


ccctattaaa 


acaaaactcc 


ct t aatat t t 


1860 


atttaacatt 

%p* L— • fc*p ^ 1 Li l— -A. V— ^ 


cac t tcrcrcraa 


acrcrc tcraa tQ 


1920 


aaataaatca 


aaaaaaaQtci 


tecttaqqea 


1980 


cfcacttcrfct~cr 


crtocccaaa 


at ccat cttc 


2040 




a. \— ci CL CA. 


t - 1 r* t" cfacrcrac! 


210 0 

*t-t ^L- \J \S 


cr c t c t acrcrc c 


cacit caatcrt 

w wi *n v*-" ^h*. 23 *^ 


crct 1 1 tcaca 


2160 


ataaacatat 


aacc t t acat 


Qcr t c t cat ca 


2220 


frtaaartat 




ctattcttcc 


2280 


ctaaacttaa 


tataaacatt 


tttqctaaqc 


2340 


w d W. C*. LA CI 


acrcrcj t acf acre 


taa t a cac ta 


2400 


cctccatoac 


at c t aaacat 


c t aaaccaot 


2460 


r , t~r , tcft;3crt~c 


ttcaaaaaca 


ttaacctaca 


2520 


ataatqtact 


cctccctcct 


cccatccttt 


2580 




acra c c a c tz a t 


1 1 c t ccaa 1 1 


2640 

pfp-p j» xy 


aaatccttta 


ctctqaaccc 


tqqaaatQaa 


2700 


aaccaaaaca 


ttcataatat 


taaatacacrt 


2760 


ttccctattc 


tctatcctat 


aat t ccacat 


2820 


trtcraaccta 


ocrc a t crc c t a 


aacttqetae 


2880 


aaaataaatt 


accttcctaa 


tcrt ccccqta 


2 940 


ccttacatat 

V** V-*- x^- W W, V* 1—4. ^ ^— ] x-* 


ctaccttatc 


taaqaaaqtt 


3000 


C tQtOCtCft C 


caaccctaac 


caataqaqqa 


3060 


taataaaacc 


CCtCTCt t tCC 


1 1 tattctcrt 


3120 


acacccttct 


acaaaaataa 


atttcicctta 


3180 

■app* %p* \*r 


ctaattcttt 


tttataacac 


taaacatcrtQ 


3240 


tgagttggag 


agaggcaata 


aaaatctttc 


3300 


caccaccaag 


taaaaaagat 


aagatcccct 


3360 


ctcagacaaa 


cagtttttct 


aagctgtagg 


3420 


aatacctcat 


aggctgtggc 


attcaatget 


3480 
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actattttgg accttacggt gcaaaagaaa 
aagtaattac actgtctact attgctacat 

5 

ataattttcc ctgttgttat tactctcttt 
ttgctacagc aacaattctc tgcttcacac 
10 tatgtgaaca gggctgccat tttgttctca 

caaatgagct accactcaca gttattttat 
tgcagaccta tgcaagagat cagtaatacc 

15 

cagtacatag ggagcatcac gagagcacaa 
ttgggaagag gaggattcat tgaaaattaa 
20 gtgcttgcag ccatatgcat tctaggtagg 

gaaacagttt ggcttactgg ggggaactgc 
gagaagtgac agtttcattt gggtgagagg 

25 

cacaggactt caaatgctta ctggagacat 
acagaaagat tcaagcagca cagtgacaaa 
30 cagtagtgaa tgtaagggta cggaaaggca 

acatgggata gaatattaat attaggcaat 
caaaggaggg catttataat gatatgttgt 

35 

attctactaa taaaaaaatg cctaggagtg 
agaattttta acatatgtaa gttttgagga 
40 aaaatacttg aatgaatcaa cactaaagaa 

ggcaacaatt gaccaaatat tcatcccaaa 
ccacattaac taatcaataa agctagaggt 

45 

gattaatttc aagcccaatg aatatttata 
gcctaaagtg aatatatttt ggtcatacta 
50 cttaagtgga aacatgaaga atttttatga 

tagacagaga cttgactgta ttgacttatt 
ggacgtaata taatatactt tcaaataaat 

55 

tgtccatgtg gaatgagtat gactagccaa 
aaagacagta ctatcagaaa ttaaaatgga 
60 gtgttaaaaa agactgaatg gcaagaaaat 
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tacattaata 


cactgcctaa 


cattcatatc 


3540 


tgtcttattt 


tgctcttgtt 


ccaaattcac 


3600 


tcatgaatga 


agaggttttt 


gaaaacaggt 


3660 


aggagggact 


cctaagtgcc 


agcctctgta 


3720 


ccagtctaaa 


gttatcctag 


gttcctattc 


3780 


tcagccaata 


tttattaagt 


atctggccca 


3840 


tgtagaagat 


aatgtccctg 


cccttgaaca 


3900 


aaagttcaga 


tgtgttggct 


aaaatttccc 


3960 


cattcaaata 


atgctttcaa 


agaagagcag 


4020 


ggaatcagtg 


cgcacagagg 


ctagagccct 


4080 


aaacactttg 


ttatgattct 


gtcacagtgg 


4140 


taagatcctg 


gacgtgcaaa 


atgtggaatt 


4200 


tgggctttgt 


catgcagtca 


atgtgcagac 


4260 


atcccatttc 


tattccaaaa 


gttcctatta 


4320 


aatgaaggtc 


agaaaatcca 


aaacaataag 


4380 


aaaagttcta 


aggaaaattc 


tcaggtatga 


4440 


ttagaatctt 


atgatcaaat 


aacatataat 


4500 


tgagaaacta 


acaaaaacac 


agtactcatg 


4560 


atgtaagcca 


aaaaagatga 


tttgaattac 


4620 


agcatgcttt 


ttaagaaccc 


ttggtacatt 


4680 


acctcactaa 


cttaaaacaa 


aaaatccagc 


4740 


aagtaataca 


aatatttgtt 


accaaaataa 


4800 


gacttttaat 


tgttaataac 


cagattaaat 


4860 


aaacaaaaat 


ataaagtgtt 


cagtaattca 


4920 


ggaaagacat 


acgcctttta 


tttagaaaca 


4980 


tctggttgtc 


agttattcca 


catttttttc 


5040 


ctcttgtgga 


aattgactta 


atgttcttta 


5100 


aagtatttta 


ataaagacta 


atgttgaagt 


5160 


gtatacaata 


actaatacaa 


gtgatgcgtt 


5220 


gagaatagaa 


tgcccacata 


gtaatcatag 


5280 
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aaaaaatata aatggatgaa gtgcaggacc 
catgcactta tgcatgaaaa aatggatttg 
5 gaaaagtgat gtgttttgaa atgcaaaaag 

tgagagcacc acaagaggaa cagttaagat 
acaggacatg caacattgca cttttcacag 

10 

cccaggtact ttagttcagt aagactatgt 
gtggtgaaac tcaagccctg ccgtcttctt 
15 tgcccatgag gcaatttcat ggcatggcat 

gtcaagtgag gaagactgaa gccgaagact 
ttgtccttac ttgtatttta aaaacccagg 

20 

atatctgaca tttggaacct tgtattatta 
gcacaggctc gggactgatc ctgtaacaaa 
25 tctcagccaa atggctcagg gttgacccta 

gctgcctacc atgccctgaa taaccagtta 
cagggggtgt taatgtcatc ctttgtggag 

30 

ttttcatcaa gaatttagtt gctgatccta 
atttataaaa cgagagtgtc cagattgaag 
35 tggaatacag atcaaattta atcaggtaaa 

aatcacctgc acaaacctga gatgggacta 
aggcctgggc ccttctacta gttagctgaa 

40 

agaagtctgc ctttgcctct agagcaatgg 
ttgggaaact ttaacacgta ctaacacatg 
45 aatatggatt atggttggag agtcaaaatt 

tagtcaatgt tgccaattac tgctataagc 
aatctggagg gcttcttaaa acacagaatg 

50 

ggggtagggt atttgcatgt caaccacaat 
aaccacattt taagagctac tgctctagac 
55 gctttttaaa ttctaccgct ggtattgagc 

tcttctcttc tgagtaggat ttattccata 
ctgtgtctgg agggtagggc caagttggac 

60 

taaggacaga gagtaggatg tgatgtcttg 
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agacaattta 


actaaagaaa 


tagaaatggc 


5340 


aaagcaaata 


atttgtataa 


tgaaataaac 


5400 


aaaaaaaaag 


tggttggtgg 


agaaaagctc 


5460 


atgaaagtcc 


ctggaaaaga 


ggaagctggg 


5520 


gatcagcacc 


aagagcttgg 


agttcaagtt 


5580 


gatagctgtt 


atgcggtcat 


ctgggggctg 


5640 


attgtgttta 


gtcggaatgt 


acaacaatta 


5700 


tggtatattt 


gcttcagggc 


tagagtgaca 


5760 


gaacatcaga 


gttatacatc 


ttgacatcct 


5820 


cccagataac 


tccaacactc 


ggtttctcct 


5880 


gttcagaaga 


agtactgaag 


aggcctcagt 


5940 


ggtgatcaga 


tgacacccag 


gcccagctgg 


6000 


aatatggaaa 


accagagaat 


caggactatg 


6060 


tatggattgt 


catctggtag 


aaatcctatg 


6120 


tctcttggtt 


ggatcttcaa 


ccacttaata 


6180 


tttgcaaatg 


atccaaaaac 


aagaagtata 


6240 


atacaaccca 


aaagtctgga 


acatgagcgt 


6300 


attaaagtcc 


caagcttatg 


tttaaaaaca 


6360 


gaacttgctt 


aacagaagtt 


aatgtgagaa 


6420 


aggatggaga 


cagaagcaaa 


agactctcca 


6480 


atctcaatct 


tgtctgagca 


ttgggataac 


6540 


agccccatta 


ccaggtattc 


tgatttaatt 


6600 


ttttgtagct 


ctttagcgga 


ttctaccgta 


6660 


ctgtagatat 


caaacttgag 


tgtgtatcaa 


6720 


cacaccgaga 


ctgtctgatt 


caataggtct 


6780 


gtcaggtgat 


actgatgcta 


ctactctggg 


6840 


gaacctggag 


ttctccagat 


gcaactaaag 


6900 


uay act uy lcil 


cx y ct cl k i^- Lyy l 




O 27 O \J 


agctttggcc 


ttctctttac 


atcttcatat 


7020 


ttcgggatac 


taaactgagg 


ccttttattg 


7080 


atgctgcttc 


aggttactag 


aggagacaat 


7140 
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gatctcagtt 


cagaaccaag 


cttaaagtta 


5 


atgcaagcta 


ggatttactt 


ttcaatcttt 




tttgggaaga 


aattagggta 


aatgagcaat 




tctcctaaca 


ctcataatat 


qqcc tat tea 


10 


tgcacaattt 


tcaggagtcc 


agaaatacaa 




caattgttgt 


catctctttc 


cacacacaca 


15 


caggcagttc 


ttacaatatt 


gagatgettt 




atttacatct 


tataaaaata 


gaacatcaga 




taaaatataa 


aaaatctata 


acccaaaaaa 


20 


atgaatgtat 


aacccagttt 


ttctgttttt 




tgtcatattt 


aattgagcca 


caagagggag 


25 


tgaattccta 


cctaatctgg 


tgtcttctgg 




ctggtcttgt 


atactagcgc 


tgggttgggt 




taataoatac 


aacaaatcat 


ccctctaacc 


30 


gtattattta 


gatattatta 


cttatattga 


• 


ttttcaaaac 


atcacaagtt 


gattttcaag 


35 


tgctttttgt 


accatcctaa 


ctttatacca 




cactctggtg 


aactgaaaaa 


catattttag 




cctctccctt 


c t cc tat c t t 


t ccccacat t 


40 


gagggtgtgg 


atacagctag 


agggagaggt 




tctgaaaaat 


ctgggttata 


aatggcacac 


45 


ttttggaaca 


tggatcttat 


tgcagaaggt 




ccaggtggca 


acatctgtgc 


ctggcttctt 




cataaaaaaa 


at aacc toaa 


ccaacccaac 


50 


atagatcaca 


cattaaggaa 


atatgtccat 




atggatctag 


aagtgacttt 


gaaagtgaat 


55 


actgatggaa 


atgaaccaag 


catttatttt 




gggtggagag 


aggattaaaa 


gaagtgaaat 




acccattccc 


atcatggttg 


ggccagtacc 


60 


ccctttattt 


tacacagtat 


agaaattcac 
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tatatctctc 


ggggactatt 


tgtcccagga 


7200 


ccccagtttc 


tttcccaaaa 


gtaacatatc 


7260 


ggtaagctca 


aatcctggca 


tcaggaaaat 


7320 


ccactgacca 


caaagacatg 


tagaaggtgg 


7380 


acagttctct 


taatacttgc 


tctgtagtct 


7440 


tgeatgeaca 


cacacgcaca 


cacatgcaca 


7500 


gcttagactg 


ggctgtatat 


aagagacctg 


7560 


gttagaagag 


acgtggaaat 


ctttctacac 


7620 


agaaagtgat 


tcacgtgagt 


catgaaggta 


7680 


cctattatcc 


ccttcatttt 


gacattgagg 


7740 


gcagtggaca 


agaaattgtc 


ctacctaatt 


7800 


gaagacctta 


tgaaaagtca 


actcccagct 


7860 


cccatttcta 


tcccatattc 


tctgggaatc 


7920 


cttagctttt 


teatctgeta 


aattttgtag 


7980 


aaggtgttag 


tgaggatacc 


tgagaatgaa 


8040 


atagtattaa 


agtgctttaa 


tttcaaaggg 


8100 


cttttttgtc 


cactcccacc 


ccattctctg 


8160 


attttatcta 


attgttattt 


actctactct 


8220 


ctgtaaaatg 


tgagcacatg 


catgatggag 


8280 


tgagactgea 


catctcacct 


ggtctcatat 


8340 


cagagacatt 


accatgccct 


cctctctgct 


8400 


gagggtttgg 


gaaggcttcg 


ggatgagtgg 


8460 


cccttcttgg 


tggaacagee 


actggcctct 


8520 


cctgagagcc 


tccctcatgg 


tgttggactt 


8580 


gagagatttt 


aggatgactc 


atatggtttc 


8640 


gaaccaacct 


agtagtttct 


agtgaagctg 


8700 


ccaatgccat 


agagtggatc 


ggtgaaagct 


8760 


aaagttagga 


cttctgaacg 


tttctctaat 


8820 


atttttggag 


tttttgtttt 


gtttctcttt 


8880 


tgactactcc 


tcagattttt 


ccagtcctgc 


8940 
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ctccccagag ggactcacat cccttttcat 
ctcctaggat tcccactcgt tactcaaaat 
5 ttaggtgaat aaaatttaat ctaataatat 

tgagataatg agtggaacca cgcatttatt 
agttatctgg accaacaagt ggtaagagca 

10 

ctgaggaaac agtacgtctg gagacacatt 
gttgtctgag atttctgggg ggctggatta 
15 ttttgagtga ggtaggaggc attggaagat 

gcatcttaac aggaccactc tggctgcctg 
gggaagatgt ggggagagta gcgaggaggc 

20 

ggttgtgttg ttggaacaca gcatcactgc 
gcctctcttt ctttcactcc ccacattcag 
25 tgtccccaag aaacataatt catcccatgt 

cgcagaattt gggcagagtt aagggaacga 
aaagcaggaa ggtatgagaa gagaacacac 

30 

ttgtaagaag agctgctttg tagtataact 
gtcaatgcag agagaatagg aaagcaatgc 
35 tctccaaaca aacctgaatg gtttgcttta 

attttcccag agtgacacaa ttggcaaatg 
tttttagccc tgtggtcttt ctattacact 

40 

aatgagactt ggctttggct cacagagaca 
gtgagaagat atattgtcgg ctcattccct 
45 gcattttttt tttttggaat gaaatatatt 

cctgatgttt tctgtagctt tcttttttta 
atagttttta aataaataat ctgtacacca 

50 

gtgtaggtac caacctccca ggattttagt 
cccaccatcc tctctcttgg actactgtcg 
55 cctcatctcc cttcagtctg ttcttatcat 

actagtcaga tcctgtgttc cttttgctga 
ataaacctaa gtccttaata agacctcaag 

60 

tcagacaaca ttgccttctc ttctccctat 
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ggcagtaacc 


acctggtgtt 


gctcgagcac 


9000 


ggataataaa 


gaaattttct 


caaacgcaat 


9060 


gtgtgaacta 


ttcttctgaa 


ataataatgc 


9120 


ttcttgaacc 


tgataataca 


aatccctagc 


9180 


gaggtcttga 


ggtgagcacg 


ttccatgctt 


9240 


gagactggaa 


tagggagagg 


agaaggcaat 


9300 


taaagcgact 


tattaaggag 


atacggctta 


9360 


tttcagcaaa 


gggaagacag 


gatctaactt 


9420 


tcttggtaag 


aaccgactga 


agggaggcga 


9480 


tccggcagta 


gtcccagaga 


gaggatggtg 


9540 


tcattcaaat 


cctaggaggt 


gggtgcctac 


9600 


tcattttgaa 


ggctcaggta 


tcagtaaaag 


9660 


gacttattta 


agggattttt 


atgaagacct 


9720 


agaaggaaag 


ggaggtcact 


acagacaaga 


9780 


tattatgggg 


accaataaag 


agcagaagtc 


9840 


gtggaagaag 


tagctacttc 


caaaggctgt 


9900 


tttgtccttt 


ctctctttcc 


agcctctaat 


9960 


ctgtcacaca 


tttatagggg 


atttgtggag 


10020 


gctgaattgt 


ccggagctca 


gacctgctgc 


10080 


acctggtccc 


cctccagaga 


gaaacaacta 


10140 


atgaataggg 


aaagttggaa 


gtctaagtgg 


10200 


tttttgccct 


cccaagagaa 


aaaagagctg 


10260 


ttctttttct 


gatagtgttc 


cactgctctt 


10320 


aaacttttat 


tttatgttca 


gggtacatgt 


10380 


aacccttgtg 


acatgcagtt 


tgaaagacaa 


10440 


gagaagcaac 


gctgcatccc 


aacaacccaa 


10500 


gagcctcctc 


actatcctcc 


ctacgtcttc 


10560 


QQcaacrcaQC 


caaacrtciata 


c tcrt taaqac 


10620 


aaatcttcca 


gtgcctctat 


ttcactcaga 


10680 


ttcttttata 


attgaggccc 


cgcagacacc 


10740 


cccaatctca 


atgtgtctcc 


agacacactg 


10800 



-122 



WO 02/062946 



PCT/US01/48437 





tcccctcaga 


agcacaggaa 


cctgctcccc 


5 


ttcaaataac 


tgctagtgag 


ctctaaggat 


ttacatgcca 


cagaggtttg 


gtgtaccgat 




* — • w Vw CI GL \ — ■ CI. VwJ CI O 


aattttttaa 

Cl V— I — w w w v_- CL d 


l"ccttaccct 

V— Vw V«* l_ C*. Vw Vw \_< l_ 


10 


agtgtctgtt 


tttcccttct 


ttgtgtccat 




gtgagaatgt 


gtgatattta 


gttttctgtt 


15 


ccagctccat 


ccgtgtttct 


tcaaagaaca 




ttcccatggt 


gtatacgtac 


cgttttttcc 




ttcatatctc 


t err' t~ a has pi c 1 


at a fat - aaa c 


20 


cctttgggta 


tatacctaat 


aatgggatag 




ctttgagaaa 


ttgccacact 


actttgeaca 


25 


gtgtatggtg 


ttatttttcc 


tccacaacct 




ataatagcca 


ctctgactgg 


catgagatgg 






3 1" f saf 1"cr 

Cc w L, C^v— ) V— *wj u k— »wj 


Vw V— VwJ CIVJ WL >-J w w V — 


30 


cttcttttaa 


aaagtgtctg 


ttcatgtcct 




ttcacttgtt 


aatttaattt 


ccttatagat 


35 


gtttgcacat 


actttctccc 


attctgtagg 




tgctgtggag 


aagctcttta 


attaggtccc 




ttartft taa 

l-> V— ■ VwJ Vw w v— w ~H CI 


rat rt*t*faar 

Vw C*. w Vw L U U VwJ C_l V_, 


r5 t" Cfr3 3 r3 t" t t~ 
& lw Vwj CI C* CI w Vw vw w 


40 


ctaggttatt 


tttcatggtt 


tttataggct 




attagttgat 


ttttgtatat 


agtgtaagtt 


45 


ctagccagtt 


atcccattac 


catttattaa 




gctgactttg 


ttaagtatca 


gatggttgta 




t*r*t"crt"tT , r i at" 

V— v_, v-. v-j l_ v„ Vw *w CI 1— 


t" oat" & t* cit" 
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atctattttt 

V»j l_ Vw w VwJ w lar \— Iw 1— 


50 


agctttgtaa 


tatagtttga 


agtcaggtac 




taggatttcc 


atggctattc 


aggctctttt 


55 


tttctaattc 


tgtgaagaat 


gtcattggta 




ttgctgttag 


cacaatgact 


attttaataa 




tgctttttta 


tttgtctgtg 


tcatctctga 


60 


tgtagagatc 


tttcacctct 


ctcattaact 



tcaagagctt 


tgcactttcc 


acttctttgg 


10860 


ttgcattatg 


aggttcaaga 


aaataaatgc 


10920 


tattttatca 


cccatgtaat 


aagcatagta 


10980 


cttctcaccc 


tctaccctcc 


agtaggaccc 


11040 


gtgtactcag 


tgtttagcta 


ttactcataa 


11100 


tetgeatgag 


tgtgcttagg 


ataaaggaat 


11160 


tgaactcatt 


ctttcttatg 


gctgtgtagt 


11220 


agtctaccat 


tgatgggcat 


tcagtatgat 


11280 


atgtatcttt 


atggtaaaat 


aatttatgtt 


11340 


ctgggtcgaa 


tggaacttct 


gttttaagtt 


11400 


aggctgaact 


aatttacatt 


cccacaagca 


11460 


ttttagcatc 


tgttactttt 


tgacatttta 


11520 


ccacagtctc 


cttgtagttt 


tgatttgtga 


11580 


tttaagtata 


cttgttggct 


gtgtgtatgt 


11640 


ttgcccacct 


tttaatgggg 


ttgattgttt 


11700 


tcttgatatt 


agacctttgt 


cagatgeata 


11760 


ctgtctgttt 


actctgttga 


tagtttcttt 


11820 


acttgtccat 


tttttttttt 


ttttgtgcaa 


11880 


ttccagggcc 


tatgtccaga 


atggtatttc 


11940 


tagattttac 


atttaaatct 


ttaatccatc 


12000 


agggatccag 


cttcaatctt 


ctgtagatgg 


12060 


atagggatcc 


tttccacatt 


gcttgttttt 


12120 


ggcatgtcac 


tttctttctg 


ggatctttat 


12180 


gtaccattac 


catgctgttt 


tggttacagt 


12240 


catgatgect 


ccagctttgt 


tetttttget 


12300 


cagttccata 


tgaattataa 


aatagtttat 


12360 


gtttgatagg 


aacagcattg 


aatctgtaga 


12420 


tattgattct 


tcctatccat 


gagcatggaa 


12480 


ttactttgaa 


cactgttttc 


taatcctcat 


12540 


aaattcctag 


atattttatt 


cttttttttt 


12600 
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ttttggccta ttgtgaatga tttggctctc 
tactgattta tatacattaa ttttatatcc 
5 taagaggttt tgggcagact atggggtttt 

gagatagttt gacttcttct tttcctattt 
ttgctttgtt tagaacttcc agtactatga 

10 

ttgtttcagt catccaggag aactttccag 
tggtttgcca tagatggctc ttattatttt 
15 agtttgttga gggtttctta acatgaagac 

atctattgag atgatcatgt ttttagttct 
accttacgtt aaaccaacct tacatccaag 

20 

gattttttat gtgctgctgg atttggtttg 
ctgttcatca aggatagtgg cctaaagttt 
25 gtgtgtgtgt ctgccagatt ttggtatcag 

gaagatcctt tcctcctgaa tttttggaat 
tatacatttg gtagaattgg gctatgaatg 

30 

ggctttttgt tgtgatttaa tttcagaact 
tcactggttt actcttggga ggttgtatgt 
35 ttctagcttg tgttcataga ggtgttcata 

gggattagtg ataatgttgc cattttcatt 
ttttctttat tagtctagct actggtctat 

40 

tcctggattt tttaatcttt ttatggtttt 
ggtactggtt acttcttgtc ttctgctagc 
45 gtcctctgtg tgtgatgtta ggttattcat 

atttagcact atacattttc ctcttaatgc 
tgttatatct ttgctctcat tagtttcaaa 

50 

gtttacccaa cagtcattca ggaacagatt 
aaagatattc ttggtattta tttcagtttt 
55 tatgattttt tttaactgct gaggattgtg 

tatgtaccat gtgcacatga gaagaatgtt 
tagtagtctg ctaaacccat ttggtgaaat 

60 

gttttctgcc tctatgatct gtctaatact 
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agcttggatg 


ctgttggtgt 


ataaaaatgc 


12660 


tgaaactttg 


ctgaagttgt 


ctatcagatg 


12720 


ccaggtatag 


aattatatca 


tctgcaatca 


12780 


ggatgcattt 


tagttctttc 


tcttgcctga 


12840 


aaaggagtgg 


tgagagtggg 


catccttgtc 


12900 


tttttgccca 


ttcaatatgg 


tgttggctgt 


12960 


attattttga 


ggtatattcc 


ttcaatgcct 


13020 


atgtagaatt 


ttatcaaaag 


ccctttccgc 


13080 


gtttacgtgg 


tgaattacat 


ttattgactc 


13140 


agataaagcc 


tacttgatta 


tggtggatta 


13200 


ctagtatttt 


ttttgagtgt 


ttttgaacct 


13260 


tcgtgtgtgt 


gtgtgtgtgt 


gtgtgtgtgt 


13320 


aatgatgttg 


gcctcataga 


atgagttaga 


13380 


agtttcagta 


ggacggtgtc 


aactcttctt 


13440 


catctggtcc 


tgagcttttt 


ttagttggta 


13500 


tgttattggt 


ttgttcaagg 


tttcagtttc 


13560 


ttccaggaat 


ttatctattt 


cttctatgtt 


13620 


gttgtctctg 


aacatttttt 


gtatttctgt 


13680 


tctgactgtg 


tttatgtggg 


ttttctcttt 


13740 


caatcttatt 


tattctttcg 


aggaatctcc 


13800 


tcacatttca 


atattgttta 


gtttagctct 


13860 


tttggggttt 


gttagctctt 


gtttctgtag 


13920 


ttgagatctt 


tctaactttt 


tgatttgggc 


13980 


tgctctagct 


atgtcccaga 


gattctcgta 


14040 


gaatttcttg 


atttctgact 


gaatttcatt 


14100 


gtttaatttc 


catttaacta 


tgtagttttg 


14160 


tattgcactg 


tggtctgaca 


gtgtgtttgg 


14220 


t" t" Pi t" Ci C C C~*C1CI 


t'tar'stcicrt't" 


C*. O U U L- CL ^-H 


14 2 8 0 


tactctgctg 


tttgggggtg 


gagagttctg 


14340 


gttgagttca 


gatcctgaac 


atctttatta 


14400 


accaatgggg 


tgtcaaaatc 


tcctgctatt 


14460 
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atcatgtggt tatctaaatc tcttcatatg 
gctctggggt ttgatgcata tgtatttatg 

5 

ttaccattat ataatgccct tctttatctt 
gaaattagaa tagcaactgc ttcatttttc 
10 atccct ttac tttaagttga tgggtgtcat 

atatagttgg gtcttgcctc tttatccaat 
ttatcccatt catattcaag gttaatattg 

15 

tgtcagttgg ttattatgca gacttgaatt 
aacagagctt cccagcttac caggagcaga 
20 atgaactgct ggatgttcaa tgtggactaa 

cctggggaga ggtcaaacac ttgcatggct 
gtgtaaagaa cagagaaaaa tcttgtgctt 

25 

taaatactcc agggcattct gttgtcttta 
ttttttcaga ttctgattga ttctgattga 
30 ggaaaaggtc aatacacaac tccaccttcc 

tcagctacat ctctctctcg tatcctgcct 
tttgtgaagt tgataactga gaagcacaag 

35 

ctagaggatg tttcccctca ctctacacta 
acaggagatt atagtgaatg aactgtaaac 
40 ggagatccaa taataatagg ggagaccaaa 

gacccaaata gctacaacaa acaataaatg 
accttacata aaaggcctat ttactttact 

45 

ttacaatgta tgttaaaagg taaaagacaa 
aagattcaaa tgcggcagag attttggaat 
50 ttaatattat aaggacttta acaggaaaag 

taagcagaga gaagaaattt ctaggaaaga 
cagtaataaa aattttaaaa atgtttttga 

55 

ggaaagagtc agtgagcttg aagatacgtc 
ggaaaaaaaa tggaaaaaaa aagaacagaa 
60 gaataacatt acacataatg gaaataccat 
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tatctaagaa 


tttcttttgt 


gaatctgggt 


14520 


acagttaggt 


cttcttgttg 


aattgaacct 


14580 


ttaaaatcat 


tgttaaaatc 


tgtttggttt 


14640 


tgttttcttt 


tgcttggtag 


atttttctcc 


14700 


ttcttgtgaa 


atggatcttc 


tgtggacaac 


14760 


ttgccacttt 


gtgcctttta 


attggtgcat 


14820 


atatgtgcag 


gtttgatctt 


gacatcatgt 


14880 


tttgtcccga 


aatttggaca 


ggaaggccca 


14940 


agcacaatag 


gaacacatca 


attataaccg 


15000 


cttgggagct 


gaaactccat 


gaggcccagt 


15060 


tttacctcaa 


agtgtcctag 


caggtattta 


15120 


ctgattaagg 


aaggtaatat 


aaccattttt 


15180 


acaaaaaaag 


tcttccctca 


agggaaacta 


15240 


tttggataat 


accaaaacct 


aaccagtctt 


15300 


tctatccttc 


cacgtgggga 


aactgaatat 


15360 


gacttaagtt 


caggtagggg 


ctaagaagca 


15420 


ttgctaaaag 


gctgagacct 


aattatgata 


15480 


ccatcagatg 


aatcaggatc 


ctgtatgata 


15540 


ctcagaccct 


atttaagaag 


tggtgtctag 


15600 


acaaggacat 


tacaggacat 


ttcagccttt 


15660 


cagcctaact 


tctagccaga 


taaacataat 


15720 


tccttttatg 


tccagctatc 


aaaaaaaaaa 


15780 


agtgtgaaga 


aacaaagcaa 


gcaacagaac 


15840 


tatcggatca 


gaaatgtaaa 


ataactatga 


15900 


catacaacat 


acatgaataa 


acagatatga 


15960 


atgaaaaggt 


aatgctagaa 


atgtaaaata 


16020 


taatttcatc 


agtatactag 


gcatgtcaga 


16080 


aataaaaact 


tccgaaactg 


aagagcaaag 


16140 


caaccccaaa 


gtgcaggaca 


actacaaaaa 


16200 


aaaaagaaac 


aagaaagaaa 


ggaagagaag 


16260 
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aaatatttga agtttaaata atgactgaga 
cacaaatcca ggaagctcag aaaataccat 
5 ggtatataat attcaaaaac agaaaatcaa 

aggagaaaag gaaacttaac tagagaggaa 
cagaaactat gcaaacaaga agggggtgaa 

10 

gttcaggcac cagtcacact tccttgttac 
cagccctgga gttctgccta tgtgtgttgc 
15 ccattttaag agtgccagca tgttccatag 

tgaattaata tcctcatatt ggaagaaaaa 
acatggtagg cagtatttat ggcaagggtg 

20 

tgagataatc agatgggaag aacgaaccag 
gtccttagaa cgtacctatt ctaactctga 
25 cattgagttt agtttctgag ttgagattaa 

ccttactgtc ttcagccact ttctttctca 
atgtgaggag acttcatgta ttctctgtgt 

30 

cgtgctccct cttagctctt aatcttatgg 
aagagccata atcttgttga atcaaagtcc 
35 tttaattcat ccactgtttc attaaagcaa 

tttgcttgga tttctgaata tagtacaggt 
taattcaaag gagagactgt aaaaaatata 

40 

actggcattt ttttagcttt ttgtggaatt 
aatccggtgt attagaaaca tgactttagg 
45 tacttgccag agatagttgg tgattattat 

gagattagac atctctataa ttgtgggcct 
aaagcttttg tttcctttga taacttatat 

50 

gtaagtcatt ttctaagagt ttcagattaa 
tgcagagcag taaattggtc agagctaaga 
55 tgatttgtat acactacttt ggtgtcagca 

accctgtcct taggggacac tggcatttat 
tctgaggtta agtgttaaga attttaagag 

60 

tacatatgta ctatttgaag aaccatttct 
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tttttccaaa 


attagtaaca 


gatgccaaat 


16320 


gcatagtaaa 


gagtaaaaca 


tctatacata 


16380 


agacagagaa 


aattttgaaa 


gaagcaataa 


16440 


caaggataag 


tattacatta 


gtcttctttt 


16500 


atgaagtatt 


taaaatgttg 


gaggttaaag 


16560 


agtagacact 


ggttgagatg 


ttaggtcttt 


16620 


tatgttgtac 


acaggtcaac 


tcaaactctt 


16680 


agctcagtat 


atctcaaggc 


ctgggatttg 


16740 


tactcatgaa 


atgtggtggt 


gatgagatac 


16800 


gtgcttaaaa 


aagctggttc 


atattggcaa 


16860 


gtgtagaatc 


actggagatc 


aggccggctg 


16920 


ggatgaagaa 


acaaatatta 


ctctacatcg 


16980 


aacttggatc 


atctcactcc 


cagcgatgta 


17040 


gacattcttc 


acagtaaata 


tttcttgcac 


17100 


cctggttttc 


ccttagtatc 


attgatcatc 


17160 


ttttaatact 


tttttcctag 


aaaccataac 


17220 


gttggcctat 


caaaatgttt 


ataacatagt 


17280 


attataatga 


gtgactgctg 


gagaatagta 


17340 


agcctggaat 


agctaaggaa 


tctgtgttca 


17400 


tcaaaacact 


aaagctggtt 


ttgaataact 


17460 


tattttaact 


gcctcaaatt 


cacaagataa 


17520 


acacttgatg 


aaatgaaaat 


gttgggaaat 


17580 


cctctgtgaa 


actcaatgtt 


caagggaagt 


17640 


ttagaaggca 


tgtgatgaaa 


taaaaaaaag 


17700 


atttgttcaa 


tttgactgac 


ttgattggct 


17760 


aacttaattt 


cattagttgg 


caaaaatggt 


17820 


gatcttgtaa 


tgataaagaa 


gagaagaatg 


17880 


tacraccracftfc 


CTQQCtat tQt 


cattaaacaq 


17940 


tttgcatatt 


aatatacata 


ggcaccagta 


18000 


tgttttctta 


agaataatac 


ctttctttgc 


18060 


taagcacaaa 


ggaaacccat 


atgggaaaaa 


18120 
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aaaacatttt gtgcttttaa tttagaatta 
aggatacaaa ataatatttt ttctagattc 

5 

cacctcaact ttacatgact cctacttcct 
tctaatcttt tattccagat atctctaagg 
10 atataatttt ctctgatact gtatctcagt 

attctctgca ttgtttctat tttacccacc 
ggtgtcagac attgagaaat gttgaaatca 

15 

cagagataca atgaactggt aagaaaaaca 
taatcttggt ggattaaaca catctatttc 
20 ggtctgggag aaagtcataa atccaaaaat 

ttttttaaat gttagaaacc agctagacaa 
tgaaaacctg actgacggaa ttggaagctt 

25 

cagtaagtct caagaattat aagtgctttc 
gtctgagaag gtagttgtat tagtatgttc 
30 tgggtaattt acaaagaaaa agaggtttaa 

cctcagaatc atggtggaag gcaaaatgca 
gagagccaag caaaagggaa aaccacttat 

35 

ttaccacgag aaaagtatgg gggaacacgc 
cctcccacaa catgtgggaa tgatggtagc 
40 ctgccaaacc atatcaatag tgaatactta 

aagatccctg aaggcctaga gtataaccag 
caggaaggga gagtgtatac agaaaaatac 

45 

atagacactg agatttacat ttatggtgga 
agaaaacagc atatgaaaca tgaggcagtt 
50 gaagcgtaat cataacactc ataatgaatt 

agttaactaa aaaatcgtga taaaactata 
gtgggaaggt gtgtccaggg taaaaatgaa 

55 

agtaggaggt tagtgcataa tatataaaac 
acttagaaat atggagtcaa cacaaaaaga 
60 agaaagaaaa ttgattggga actggggaag 
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tataggcact 


ggaaaaatat 


ttttattaaa 


18180 


aaaatgagga 


attacagtgc 


ttagatctat 


18240 


aatatatttg 


ttttcacatt 


ttccctctgt 


18300 


cttttaccaa 


tctctttagc 


tgagattgta 


18360 


tcactgctga 


ctgatgtgcc 


acctacatcc 


18420 


caatagcatt 


caacagaaat 


gttcctacct 


18480 


aaaagataaa 


taagatacat 


tatctgtcct 


18540 


cttataaaaa 


tataattttc 


aaataatggc 


18600 


ctcatgaaac 


ccaactaaat 


gactataaag 


18660 


tcaaaaataa 


tatgagagaa 


attggatcat 


18720 


gcaataaatg 


acttagcagc 


ctccggaaag 


18780 


agaaaaaatt 


tgattctctt 


tgtagaaccc 


18840 


agttgaatat 


gtctggggac 


acagctaatt 


18900 


tcacattgct 


aaaaaagaca 


taaccaagac 


18960 


tggactcaca 


gttccacatg 


gttggggagg 


19020 


catcttacat 


ggcggcagac 


aagagagatt 


19080 


aaaatcatca 


gatctggtga 


gacttattca 


19140 


ccccatgatt 


caattatctc 


ccagtggttc 


19200 


tacaattcaa 


gatgaaattt 


ggtggggaca 


19260 


agtaagtgaa 


ttccctgaga 


atagtgaatt 


19320 


tcctaactgg 


aataggagaa 


tataaggcgc 


19380 


taatgaaaca 


gatcaattac 


ctgtgggtct 


19440 


gtatttaaaa 


caacttattt 


tcaggtatgt 


19500 


actaacttca 


gagagaaaaa 


tggtcaaaag 


19560 


agccattaat 


gatttaaata 


ctataaattg 


19620 


tgggagaatg 


agggaaaagg 


aagtgttatg 


19680 


ggatataggg 


gttagtctat 


tgtcttcagt 


19740 


caaaaattag 


aacagtattt 


aaaatcattc 


19800 


aacaattaaa 


atttaaatac 


ttgcttctgg 


19860 


ggggcactaa 


tgatttcttt 


caggctaaat 


19920 
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ccagtgaaca 


ttttactgtt 


gttagcttgt 


ctctctgaag 


tatttaataa 


taattgaatg 


19980 


atttcttcct 


actgaaatat 


ttctgtgctc 


aaaaaatgat 


aagtaattta 


tttatattac 


20040 


ctaatataac 


cctttcaaca 


aegctatgae 


atattactat 


ctatatttta 


cagatgagta 


20100 


aactgaggtg 


cagaaagcct 


aagcaacttg 


ctccagctca 


catagggcat 


aggattcaaa 


20160 


atccagctca 


aggccgggtg 


cagtggctca 


cgcctgtaat 


cccagcactt 


tgggaggctg 


20220 


aggagggtat 


attacttgag 


atctggagtt 


cgagaacagc 


ctggccaaaa 


tggtgaaacc 


20280 


cagtctctgt 


taatatataa 


tataacatat 


tatatagaga 


tacattctat 


tatatatata 


20340 


atatattcta 


ttataatata 


atatatattc 


tattatataa 


tatatattat 


atattatatc 


20400 


tattatatat 


aatatatctg 


tatatgatat 


ataattatat 


ataatatata 


gatattctat 


20460 


tatatataat 


aggatatata 


ttctatatat 


atagaatata 


tatatttctc 


tgtcaaattt 


20520 


catgttattg 


ttctcttgaa 


attgttatct 


gctaataata 


acctgatact 


gtctcatttc 


20580 


ataccatatg 


gagtgggagg 


agagactgat 


gcctgtcctt 


agtccagaat 


ttcaacctct 


20640 


cttaggaagt 


catgaaactc 


tgtacccttg 


tettctgeaa 


tgccacattt 


tccgagtttg 


20700 


cctgctatct 


tccttaccct 


ctttttcata 


atcctctggg 


aaatcttatt 


tctgattata 


20760 


ctgtctctat 


cctctagaaa 


aagggtttga 


cgttttctgc 


ctccacccac 


aagcctgggt 


20820 


gacctcatcc 


ctctgggcga 


agagctgttg 


ggtttacctc 


tatagecatg 


ttctcattcc 


20880 


ctagctcctg 


ccagttatcc 


acttcttgtc 


aactttattt 


gaaggtccct 


acacaactct 


20940 


gatcaaatac 


gttcaaaatt 


ttactatttg 


ttcctgcagt 


tctgtcctgg 


tccatgtcta 


21000 


ttgccttact 


gaatggctct 


atcactcatt 


cagtttccta 


acccagaatc 


tgagtgtcat 


21060 


cctttcctgt 


tttttctcct 


gatttacaca 


tccattatac 


caacaagtac 


tgtcatttct 


21120 


tgttcctaag 


catctttcaa 


atccattcat 


tctctccaac 


ttcactagca 


tcacaggtag 


21180 


ttcagtaacc 


atcatccctt 


ccctagagaa 


ctgcctctac 


ettgagecta 


attctctgat 


21240 


tttattatta 


tttattcctg 


ttgtctgaaa 


ccaggacaat 


atttctgaaa 


tataatctat 


21300 


tttaattttg 


agaatcgtaa 


tttcagatta 


tagaccacat 


agtttttgga 


attaagtaaa 


21360 


actggatttc 


aatctcatct 


ataccagcta 


tacgaatatc 


tctgagtttc 


agtttcttct 


21420 


tctgaaaaat 


agaaataata 


atattacctt 


ccttcattcc 


ctttccattt 


gtgtgtcttt 


21480 


tgctcattca 


tcaggtctcc 


aattagatgt 


ccccttatct 


aggaagtctc 


ctctgcctac 


21540 


ccaatattqq 


at teat teat 


cctcctatga 


gctctcttag 


gatatcctac 


tttaccttgc 


21600 


agatcactga 


ttacaatggg 


ttgtaatgat 


gtatttactc 


atctgtattg 


ccactttacc 


21660 


ttgcagataa 


ctgatagtaa 


tggattgtag 


tgatttattt 


actggtctgt 


ttttccaagt 


21720 


agtctataaa 


ctctttgagt 


gaaaattatt 


ttttaccatt 


ttttcagaag 


cacacggcac 


21780 
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actgcttggc ttatagtagt ttttctaaag 
actttgcttt cctcctccat gtaaagtagg 

5 

ttcaggagat taacagggcc atgcaagaaa 
accagctcgt tctatgatac cagctgcttc 
10 atacaaccaa cttgttgcca ttttatggcc 

catgtaaata aaacatttaa gttgtgtata 
ttggccataa aataatggca aaaactgcaa 

15 

atagtagtgc tccaggaaga caagagtact 
atttacaaaa taccatttac ttgtttatat 
20 taaatcctct accactgata tccacagtaa 

tatcaggcag cccctgcttt ttcagcaaat 
aacacaaaat cagttaatgc ttccgatata 

25 

tctcagcaca gcctttaagg ttccaaacat 
agtgaggtgt ggagtggggg aagttgattg 
30 cccagagccc tgcatgcatc agacctacag 

ccagggactt gccttcctag gctttcgggc 
atttctggga ggggaatgag aagaataatg 

35 

cagggttggg gtgggttgag tgcaccttta 
gtttttgcta cagtcgatag gagttagaaa 
40 ataaggttgt ttagttactt ttgtcccctg 

acattttaaa cctttgacta gttacctcat 
tattctctgt gtctgactgg ttatttttat 

45 

tgcagaattt ggtgtacatg gagtagggca 
atgggctgtg gcctgaccta gttctccagt 
50 gcagtaaact gaggctgctc atgataaaat 

agcattctta ggataagaaa gaagcttgta 
gatgaagtaa aagaagccaa taaaagggaa 

55 

aaagggctgg gagaaaaaca aaaacaaaaa 
gcctttcccc cttttccctt cccatgccac 
60 ccatccagat tcaggcctca ttttgggttt 



PCT/US01/48437 



aaaatgctga 


atgaatgaaa 


actgcacacc 


21840 


cattatttta 


ttttacaaat 


gaatcttagg 


21900 


tgtttgccat 


tggtttgatc 


ttagtcaaag 


21960 


cttcacttag 


aaaggtgttg 


ttttattgcc 


22020 


actcctgatg 


aatctctaaa 


ggtggaattt 


22080 


ttaaggttgg 


tgcaaaagtt 


aattgtgatt 


22140 


ttacttttgc 


accaacctaa 


taaaatagta 


22200 


gtggaaatgt 


gtgaccctgt 


gaaagggggt 


22260 


aacgactttt 


gaagagagtc 


tggcatacag 


22320 


ttagcagtat 


tattttcata 


ggaaagactg 


22380 


ttcaaaatag 


acatccacac 


ccatattggg 


22440 


aaagaggaac 


tagctgtgcg 


tctcagaaat 


22500 


ctgctagaag 


aggaatgcag 


atttaaactg 


22560 


ggtctagacc 


aaagaacttt 


gaggaacttg 


22620 


cagacattgc 


aggcctgaag 


aaaggtaggt 


22680 


tgatggcttg 


tttcacttcc 


tgggttaaga 


22740 


aggccacttc 


ccatgaccca 


agctggtatt 


22800 


aatgtgcatt 


cggtgggggg 


atgaggaaca 


22860 


agcctcctaa 


aggctgacat 


tggaggctgg 


22920 


agtcttcagg 


atatcaaaac 


actgaaagca 


22980 


ctactgttaa 


tgcccttctt 


ctgtggtctt 


23040 


aactcaggat 


ttaagaacag 


atgtatcaca 


23100 


caggcagggt 


ccccagcctt 


gtacatttag 


23160 


ggtctttggc 


actattcatg 


gggcacccag 


23220 


ctgctaatga 


aaaacacctt 


gtagactttt 


23280 


agtggtcgtg 


gccattgtaa 


tctttcccca 


23340 


ataatgtacc 


cagaatcaca 


caagttaaca 


23400 


caaaaacaga 


attcttggtt 


cctagcccgg 


23460 


cacttctgca 


tccccatcat 


ccttctaaaa 


23520 


ctagtctatt 


tgacaacttt 


gccaagaaaa 


23580 
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acggatggct gtttgtacca ctctcctcat 
aggatggtga cataaaaatg aggaaataag 
5 actggggcca atagaagagg gaagggtggt 

tgggggaaag acattgtatt tcttaagagc 
aggtttaaat tcttctccaa cctggaagaa 

10 

cagttaaaaa aaaaattgaa tatagagaag 
tcttggggac ttcatgtgcc gtagataatt 
15 gaggcagtta tatgaattta tggccagagt 

tgaccagaac ctgacctatg ttaccaaaga 
gctttatttt gatataattg ctctgccctc 

20 

ctaaacacag agagagaacc accagtgcgt 
tctcagtctg aaggtgtgcc tgagagattg 
25 taaagtacac atttagatgg actaagaaga 

gacttttgca atgtagaagg aacataacac 
ctgactaatc cacatgtggg caatggtggg 

30 

agagactata aaatgctgtt catgtgtgag 
gttcttacaa ggtggtcaca agaggggtgg 
35 agatgtctcg atataggaat gaaattacgt 

cttggagctt taaaaattaa aaggagagag 
tgaagtagcc gagcttttgt tatcaacatt 

40 

aaatataaaa gaaggctttt aaaggggatg 
ctggctacaa catcagtggt ggattgaata 
45 tcagaggcta taatgaatac ttctgataaa 

ggagaaatac acagagttga aaaatggttg 
agagggtagg caatggaagg aaccaatgac 

50 

cactgtgttg cccaggctgg agtaccctga 
cagcctcctg agtagctggg attacaggcg 
55 tttgtgtaga gatggggttt caccatgttc 

gcggtcctcc caccttggcc tcctatactg 
agctttaaac ttccagattt ttaacatgga 

60 

agaaaggaca cttgtaattt catagccctc 
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gctctctcct 


tgcaattgtg 


acccatgagg 


23640 


agactggtgg 


tgctgctgat 


ccacaaggtg 


23700 


ctgggtttcc 


ccacctcttt 


atcaggggtg 


23760 


ctgatgcata 


taggtgatgt 


ggcaaaacca 


23820 


atgtccctca 


tatgcatcca 


atattctcag 


23880 


tcaagccctg 


gatggtaaac 


acgtttttgt 


23940 


aacacaagga 


tttctgtggt 


ggaaaaagag 


24000 


ttcttagaat 


gacaaagcac 


cagecttgaa 


24060 


attatgggaa 


agcataggag 


tcagttacaa 


24120 


gaggaagctt 


acagtcttac 


cagagaggca 


24180 


ttgataagac 


agacccaaca 


tgtcctaaat 


24240 


aggcagaggt 


tcttttacat 


taaagaagaa 


24300 


atcatcttta 


gtcactttcc 


aaataacttt 


24360 


atttcttaaa 


cccttattta 


cegtcatagg 


24420 


gaagtggggg 


aatagtgtct 


gagagaagat 


24480 


aagattagat 


gatgtggtgg 


gtttgttttt 


24540 


aacattcctg 


caaatggttt 


caatatatgc 


24600 


ctttggaaca 


acttaaataa 


gtcaaatata 


24660 


attcgaggta 


agactacaat 


agatttgtgc 


24720 


aaaaatttta 


actttatgct 


aagggtggtg 


24780 


gtgactcaat 


tgtttttgta 


tttgatcatt 


24840 


tccaaagcca 


caaatggaca 


cagggaacat 


24900 


aatagaataa 


tggcttgcct 


tgagaagatg 


24960 


tgtaggatta 


gaatgagatt 


gattgagcat 


25020 


aaactatcag 


atatttttga 


gacagggtct 


25080 


tgccattgtg 


gcttactgca 


gcctgggcct 


25140 


cccaccacca 


cacccagcta 


atttttgeat 


25200 




l_ i_. U del O.L. 


L- y y y v u^i-d. 




ctgggattac 


aggattgagt 


caaaatgccc 


25320 


tagtgtagtt 


aattgctgga 


ccttccacca 


25380 


ttagggttcc 


tgttcccttt 


agctcttgcc 


25440 
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actcttgatt 
ccaccttact 
tctaaggttt 
atttatgggc 
tgtaacaaaa 
tagccatatg 
taggtctgaa 
tcattggaac 
ttgttatcta 
tgctaagaaa 
tagtttagtg 
ttggccctgc 
gaataataga 
aaaccagtgc 
ctactatcac 
ctgccactca 
tctccggtct 
atgtttttca 
caaagattgg 
aagactatgc 
gataatggag 
tatatcgaca 
taaaataaaa 
ttaaacatca 
taaattaaca 
tagaagacct 
gtgttacagg 
agtggcaaga 
tcttggtgga 
tttgggcatg 



agcaactact 
tttttgaaag 
gaaataaaaa 
tttattagaa 
tttgaggatg 
ctccaaatct 
tatacctact 
agagactgca 
acttattcca 
attaattgta 
gatcccctta 
cactgtttac 
atatatctct 
gtgtcactta 
ttactcattt 
aggacatgct 
tggggaaaat 
tggtaccatg 
tcagaagagt 
agaggcgggc 
tttatgaggt 
ctttagcatc 
tagcaggcct 
gagaactgtc 
atttaccaac 
aaatagaaga 
aaaagtatgt 
actcaacgta 
aggcccagca 
gtggctcact 



tacagggctc 
ggaaattttg 
taacctcttc 
tgactttaaa 
tggattaaat 
aaattttgtg 
ccatcgcata 
agaggaagac 
aattgggtgg 
caagtttttg 
agcaaaggag 
tcaagttact 
caggggaatt 
ataaatgcta 
ctggcttgaa 
agacagggac 
agaaaagcag 
tggtcagagg 
caggattgtc 
ctagaggctc 
gggttacatt 
ctctgagcag 
tattttttgt 
cccctctggg 
tgccccagaa 
tattcggcct 
ggaaattatt 
acagaaaccc 
attaactaca 
cttgtaatcc 



tgaaataatc 
aggaagactg 
tcccttctgg 
ttaagcccag 
gtaagaccaa 
ttctgtctag 
taaatgcaaa 
aagggcctct 
tttctacctt 
tgcaaacaag 
ttctggaatt 
taatcttgtg 
taatgatgaa 
aatacatatt 
gtataaaaca 
tttactcctg 
ataagagggg 
ttgactcttc 
ctcctttttt 
tcctcttcct 
cacatgctat 
aacacaaaat 
cttaaattta 
cttaatttaa 
gggagaagag 
tcctcaaaat 
tcagaaccct 
taagagggct 
ttgtctgtgt 
cagcactact 



tccacatagg 
gatgtcttct 
gacaagtagg 
agaaggacaa 
aaataagacc 
gaagtgctaa 
ccacctcaca 
tggcccacct 
ttagctatta 
agatcttcca 
acatcacctg 
tttctataat 
ataacacata 
gactattatt 
taaacttcct 
tcatgaccct 
tctatataat 
ctctcttttg 
gctctcagta 
cttgcagtct 
ggagtaggta 
ttagatatgg 
atttgtactc 
agtcattcta 
gtaattttta 
ttctctttca 
acaagcagcc 
atgggactgc 
taaaaatctg 
ggatgctgag 



aagatttgcg 
atttaggtct 
taaattgtca 
ttcccttaag 
tattttattt 
agtgtatata 
aattccatta 
ctacgtagtc 
tagataatgc 
ttcaccagac 
gatttaaatt 
tctgtcctgt 
taaaacattt 
actggtattc 
ttaacacttc 
gctccaacct 
tggtgagagc 
cagtttgtct 
agttagatac 
ctgttctaat 
cattcagacc 
atcaaagggg 
gcaaattttg 
ataaagccca 
tttcaaacat 
aaaaagtaag 
tctaatcaag 
aagtgtcctt 
taagtttaag 
acgggaagat 



25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
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cacttgagcc 


ctggagtttg 


agaccagctt 




aaaatacaaa 


aattagctgg 


gtgtggtggc 


5 


ctgaggtcga 


ggctgcagtg 


agtcacgata 




atgagaccct 


gtctcaaaaa 


agtctggtag 


10 


aacataggtt 


cttttctatt 


ttccctaagt 




gtacaaaaga 


gagaaatttt 


acagctaggc 




accgtgatgc 


ccccgagctg 


caaaaccagc 


15 


gaatgtacaa 


atagggagtg 


ggtcacagag 




acaagggcag 


aagggcagag 


caaggtcaca 


20 


ggttccatgt 


cctgctgtgc 


acacattgtc 




tcaagagcag 


agaaccagtc 


tgactacaat 




agcctgaggg 


tgctgcagga 


gaccagggtg 


25 


gcagacaccc 


ccagagcggc 


tgtccatagg 




gttattcctt 


actgggaaaa 


gaattcagtg 


30 


aaaaatatga 


ctttgttctg 


cccggccccg 




tgtttcctga 


acatcgctgt 


tatcctgttc 




caaacacaca 


tgttttacaa 


tttgtgcagt 


35 


cgtacatctt 


cagcttacga 


agatgacggg 




gaaattataa 


aagtattgat 


tggggaagtg 


40 


attcagagat 


tgcagtaaag 


acaggcataa 




aatgaatgtc 


catgaaatct 


tcacaattta 




tccgttcggg 


gtccctgact 


tcccgcaaca 


45 


accctctatg 


ctcagtcaat 


cattaaatga 




gacttgaagt 


ttattttaca 


tgatctaagt 


50 


tgcatggaac 


agtccatccc 


ttaactttca 




ctcttacagc 


atctggttag 


atcttgtttt 




tgttcaaaga 


tttaattcat 


ttatatttaa 


55 


taacatttta 

W W*- \*r lv LI 


t tat t ttaacr 


cracrQt ttcit t 




tggtctacct 


ttgtgattta 


gtaattttat 


60 


tgtgtatctg 


ctgtagttct 


ttgccttgta 




gttttataaa 


ttgactattt 


taaggtgaca 
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gggtaacata 


gagaaacccc 


atctctacaa 


27300 


acaggcctgt 


aatcccagct 


acttgggtgg 


27360 


gtgccagtgt 


actccagcct 


gggcaacaga 


27420 


tttattggag 


gaaccagtcc 


ccaatatttc 


27480 


gtcagccagt 


ctgagaaata 


aagagaatga 


27540 


ctccgggggt 


gacatcacat 


atcagcaggc 


27600 


aagtttttat 


tagggatttc 


aaaagacgag 


27660 


atcacatgct 


tcaaatggca 


ataaaagatc 


27720 


aggccagggt 


gaaattagaa 


ttactaatga 


27780 


attgataaac 


atcttaacag 


gaaacagggt 


27840 


tcaccaggct 


ggaatttcct 


aatcctagca 


27900 


tatttcatcc 


cttatcttca 


atcgcataag 


27960 


caccccctgg 


gaatgcattc 


tcttcccagg 


28020 


atatttctcc 


tacttgtttt 


ctgcaataag 


28080 


caggcagtca 


gaccttatgg 


ttatctccct 


28140 


tttttcaggg 


tgcccagatt 


tcatattgtt 


28200 


taacgcaagc 


atcacagggt 


cgtgaggcaa 


28260 


attaagaaat 


taaagtaaag 


acaggcatag 


28320 


ataaatgtcc 


atgaaatctt 


cacaatttat 


28380 


gatattataa 


aagtattaat 


ttggggaact 


28440 


tgttcttcag 


ccatggcttc 


agccggtccc 


28500 


gtagtttgtt 


attaatgact 


tctattaccc 


28560 


tgaccttctt 


tgcctctttt 


gacagttttt 


28620 


acaggtaatt 


attctcttta 


ggctagtctt 


28680 


actatgtttc 


ttcttaaggc 


taaagtgagt 


28740 


ttaaaaattt 


tccagccgct 


ctttgacttt 


28800 


ggttagcatt 


aatggataag 


gatttactag 


28860 


tttataaata 


t ttatttctt 


tcttatactc 


2 8 920 


gaattgctaa 


gctttgaatt 


ctttaacgtt 


28980 


ctcatcatta 


ggcttacata 


aaaccacgta 


29040 


ataacaatgt 


ttgttgaaac 


aagaactcta 


29100 
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gacttgtact gtcctcccac aaaactaaat 
aaaaccattg aagactgggt aatttattat 

5 

ctgggaagtc taaaggcatg catggtgtaa 
agtggaagtg agcgtacaaa gcagagagag 
10 agtccactcc tgtgataact aaccttctcc 

gtagaggtct cctgatgtaa ttacctcttg 
gcaattaagt tttcaaatga gttttagagg 

15 

accctccaaa actcatgttc ttctcacata 
caaaattttt acccattcca gcattgactc 
20 gcagatatgg atgagactca aggtatgatt 

cctgtgaaat caaaacaagt gatttgtctt 
agactttctc agttcaaaag aaataactag 

25 

agtccaaaac ccaaccaaga aaacagcatt 
nnnnnnnnnn nnnnnnnnnn caccttttct 
30 catgcaagga atggaacagg ccatgccagg 

catggctgtc atacattcac atctgtggaa 
acccaaagtc cttggggtaa gttgcaaatc 

35 

tatttcataa gacctaatgc caaatacact 
ccctcacgac gtcagtaatt acttttcagg 
40 gtcacaacat gcttccagtt gagttatatt 

ggttgtgggt catccatgat aattgtattt 
aggccaccta tgcactacta gctgacgccc 

45 

cacatattgc tactatttct ttctggcacc 
aatctatcct ctgagcattt ccatctggag 
50 ggattaaaag aatgacagaa tagggcaggt 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
cttcaaacca cggngaaaat gtcactgtgg 

55 

ttttctggag ggggggaaat atagttattt 
tactaggttt gtttgtttat ttatttttaa 
60 tcatccaggc tggagtgcag tggcacaatc 



ttttgatatc tttgttttct gctgctgtaa 29160 

aaaattgtat tttgcttata gctctagagg 29220 

acctctagtg aggataatcc catgaaggaa 29280 

agatggggcc aaatttattc ttttttcagg 29340 

aaggataatg acattcatcc attcatgcag 29400 

aaggtcccac ctcttaatac tgttaaaatg 29460 

ggacattgaa accatagcat ctcatctaca 29520 

caaaatacaa tacttctgtc tcaatagcct 29580 

taaagttcta agtccagagt cacatcagaa 29640 

tatctggagg caaatgtttc cagctgtgag 29700 

caaaatataa tggtgaaaca ggtgcagaat 29760 

gcaagaaaaa ggtgtaagtg gtttcaagtg 2982 0 

aagctttann nnnnnnnnnn nnnnnnnnnn 2 98 80 

gctgcccacc ttttctgctg ccatgacaac 29940 

ggctggccct ggtgtgcccc agctgggaaa 30000 

aggattgcaa gagaagttct tgaagggaga 30060 

taggggaaga ccaatggtgt tgcaaactca 3 0120 

gtcccactag cctcataata caacagccaa 30180 

ctcttcatct gaaaaactat atagtattct 30240 

taaaatatat tgagttcaga tctattctag 30300 

aaactcagtg tggaaaagca gcaaaataaa 3 03 60 

cctttacagg ccttcctgat agctgacagg 30420 

ttttcttagc cccctgctca gaggatgaaa 3 0480 

tctcagatga gcatgcccaa aagaaatgaa 30540 

gaaaggatat nnnnnnnnnn nnnnnnnnnn 3 0600 

nnnntaaatt aggtttatca ctagcccaag 3 0660 

atattctgga tccttttttc acattattca 30720 

ctcataaaat atgttattta tgttggcaca 30780 

ttttattttt tgagatggag tctcgctcca 30840 

tgggctcact gcaacctccg tcttccgggt 30900 



-133- 



WO 02/062946 PCT/US01/48437 



tcaagagatt 


ctcctgcctc 


agcctcctga 


gtagttggga 


ttacaggcat 


gagccatcac 


30960 


acctggctaa 


tttttgtatt 


tttagtagag 


acggggtttc 


agcatgttgg 


tcaggctggt 


31020 


cttgaactcc 


tgacctcgtg 


atctgcctgc 


ttcagcctcc 


caaagtgggt 


ttatttttta 


31080 


atgtatgaat 


aattatttta 


aaatatactg 


tatgttttag 


agcagtttta 


ggcttaaaga 


31140 


aaaattgaga 


gcaaagtaca 


gatctatccc 


atatgcccct 


tgcccccaca 


aatgcatagc 


31200 


tttttaatta 


tcaacctcct 


ccaccagcat 


gtaacctttg 


taccaattga 


tgaactacat 


31260 


tgacacatca 


ttactaaaag 


tacacaaatc 


acaatggggt 


tcattctctg 


tgttgtgtgt 


31320 


tctgtgggtt 


ttaacacaag 


tatagtgata 


tgaatctacc 


tttgtagtat 


cacatagtat 


31380 


cccaccaatc 


cctggcaacc 


acaaatcttt 


tattgtcttc 


atagtttgcc 


ttttccagaa 


31440 


agtcatacag 


ttggaattgt 


acaataggtg 


ttttttttag 


attgcgttct 


ttcacctaaa 


31500 


tacatgcatt 


taaatttcct 


ccatgttttc 


ttatggcttg 


atagctcatt 


tatttttagt 


31560 


actgaataat 


atttcattgt 


ctagaagcac 


cacagtttat 


ttatgcattc 


acctactgaa 


31620 


ggacatcttg 


atggcttccc 


aactttggca 


attaggagta 


acattgctat 


aaatatctgt 


31680 


gtgcaggtca 


tgtgtggaca 


taagtttttc 


aactcatttg 


gttaagtatg 


attagtgtat 


31740 


tgtaagtaag 


agtacgttta 


gttttgtaag 


aaactgccat 


actgtcttcc 


acatggcctt 


31800 


gtcactttga 


agatttcaat 


caacagtcaa 


tgagttcctg 


ctgttctgca 


aaatctccag 


31860 


caattggtag 


tgccagtgtt 


ctggattttt 


accattctaa 


caggtgctag 


tggaatctca 


31920 


ctatcatgtt 


aatttgcatt 


tcccttgtga 


cataagatgt 


ggaccatctt 


ttcatgcgtt 


31980 


tctttaccat 


ctctatatct 


tctttcgtaa 


ggtgtcttgt 


taaattcttt 


ggcacatttt 


32040 


taatgaggac 


gtttatgttc 


ttattgttaa 


gttttaaatt 


tgttgtatat 


tttgggtaac 


32100 


gatcctttgt 


cagatatgac 


tcttgcaaat 


atcttctccc 


agtctgttgt 


ttgtcttttc 


32160 


atctcttgac 


aatgtcttct 


gcagagcaga 


actttttaat 


tttaagaaaa 


ttaagatgat 


32220 


ttattctgta 


tttcatggtt 


tgtgcctttg 


gtatcgtatt 


gacaaattca 


tcaccaaacc 


32280 


caaggtcatc 


tagattttgg 


cctatgttat 


ctgttaggag 


ttttatagtt 


tttagtttaa 


32340 


tatttatgtc 


tgtgattcat 


tttcagttaa 


tttttgtgaa 


agatataagg 


tatgtgtctt 


32400 


ttattcttac 


ttatggatat 


caagttttaa 


gtactatttt 


ttaaaatact 


ttttttttca 


32460 


ttgtattact 


ttagctcatt 


accaaagatc 


acttgactat 


atttgtgtgg 


ctctctccag 


32520 


actttctact 


ctattccatt 


at c tat t tot 


ctatttcttt 


crc t aat acca 


cacaatcctq 


32580 


attactatca 


ctttatagta 


tgtcctaaaa 


ttgggtagtg 


ttagtcctac 


aactactact 


32640 


tcttcttctt 


cttcttcttc 


ttcttcttct 


tcttcttctt 


cttcttcttc 


ttcttctaga 


32700 


ttgtgtttgc 


tattctgagt 


tttttgccat 


tccttataaa 


ttttagagtc 


agtttatcaa 


32760 
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tatatacaaa ataacttgct gggattttaa 
tggaaagaac tgacatcttg acaatgttga 

5 

tccatttatt ttatcaaagt ttgtagtttc 
tatacctaac aaatgccaaa taccaatacc 
10 aaatggcatt atgtttttaa ttccaaattt 

aattgtctct tgtgtattaa tcttgtataa 
gttccaggaa tttttggttg attctttcag 

15 

aacaaaggca gttttatttt tccctgacca 
ttactgtttc tgctatgact tccagtacaa 
20 ctgccttgtt cctgatctta gtgagaaagc 

ctgtaggttt ttcacaaata ttctttatca 
ttttcaagag tttttataat aaacaggcat 

25 

gttaatataa tcatgtgatt tttcttcttt 
gttttttgaa tgtatagtca gtcttgcata 
30 acaatttttt tgatgcattg ttagatttag 

tatttgttca tcagagatgt tggtctgtag 
ttttggtaga tggtagtgct agcttcagag 

35 

attttctgta aataatggta taattttttt 
tccttctagg tctgatgctt tctgttttgg 
40 aataattata ggcttattca gagtgtctct 

aggagttatc cattttatca aggctatcaa 
ttttattatc cttttaattt ctatgggaac 

45 

agtaattttt gtcctctctc tttttcttag 
tacatacata tatttttaaa aactgctttg 
50 gttttcaatt ttatgatttc tgctctaatt 

agtttaattt gcacttcttt ttctagtttc 
gatcatgatt ttctaatata ttcattcaat 

55 

actacatccc ataaattttg ataagttgtt 
ggataatggt actcttactc tgctacgttt 
60 ataccccaga cttcaagaaa atatcatatt 
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tggagattgt 


attaaatcta 


ctgatcaagt 


32820 


gtcttcctat 


ccatggacat 


ggaatatctc 


32880 


ttatagattt 


tgtacatttt 


tttttagatt 


32940 


taaatatttt 


atttttagag 


gtgctaattt 


33000 


cacttattca 


ttataagtac 


ataagaaagt 


33060 


atcttgtaaa 


cctaatataa 


ttgtttgtta 


33120 


attttctaca 


cagacatcat 


gtcgtctggg 


33180 


gtctgtatat 


ctttatttcc 


ttttcttgtc 


33240 


tgttgaaagg 


agtagtgaca 


gggctcatcc 


33300 


tttgagtttc 


tccattaagt 


atgaagttag 


33360 


acctgaagaa 


gttctcctct 


attaataggt 


33420 


tggattttgt 


tcaatatttt 


ttctgcgtct 


33480 


agcctgttga 


tgtgatgaat 


acattaatcg 


33540 


cctgggataa 


atcccaagtg 


gtcatggtac 


33600 


tttgcagata 


ttttgttgag 


gatttttgca 


33660 


ttttcttttc 


ttgtaatgtt 


tttgaatgat 


33720 


aattagttaa 


ggagtatgtc 


ctctgtttct 


33780 


cttaagtgtc 


tggtagaatt 


caccactgag 


33840 


aaatttatta 


attattgatt 


caatttattt 


33900 


tcttgtgtga 


gttttgacaa 


attgtcttta 


33960 


actgtgggcg 


tagaattgtt 


cataacatta 


34020 


tgtattgatg 


ttccttcttt 


catttatgtt 


34080 


gctggctaga 


ggctttttaa 


attttattta 


34140 


gattttattg 


attttgttta 


ttcatgttct 


34200 


tctattatag 


ctcttcttct 


gtttatttgg 


34260 


ttaaggtaga 


ctctcaggag 


attgatttta 


34320 


gctatatatc 


ttcctctaag 


cattgctttc 


34380 


gaaatgtttt 


tgatcctttg 


gaatgccatg 


34440 


agatccctgt 


gtactcatac 


ttgtgtcctc 


34500 


tgcttttctt 


gttttcttcc 


cttgtatata 


34560 
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ctctcctttc 


ttcaggttac 


ttctccttgc 


tacaataatt 


gctttataat 


tcctaggcca 


cttctctgag 


ctcatgagaa 


aaaaagagaa 


ataattgtgg 


tagaagtaac 


atcgtaggta 


tcagaaaagc 


acctctgggc 


tggttctggc 


atactccttt 


taaccttgaa 


attattggca 


atgatagacg 


tcgcgagaga 


taatattggt 


aagattacat 


tcacggttat 


ttatgtgaaa 


aaagagtgat 


aagagaatac 


taagaaggtc 


atatgactat 


caaatgaagg 


agaataagga 


tagcttagtg 


attggaaagg 


gggtggcgga 


agcaccattt 


ttgtggtgga 


caatcagttt 


ttgctttctt 


caccattttt 


aggttgtgca 


gattctgact 


gccctgatga 


gccttagcat 


gggaataaca 


atgatgtgta 


tggcatctaa 


tacttatgga 


agtaacccta 


tttccgtgta 


tatcgggtac 


acaatttggg 


ggtcagtaat 


ggtgagtaga 


gtatcttttg 


atataattga 


agataacata 


aagcagttgt 


caataccctg 


atctttggaa 


tcctattcta 


accgtatctt 


gtaattctgt 


aaatctgaga 


gtggtatcta 


accatttctt 


actcaatttt 


ctcatattta 


aagtactaca 


aataatatag 


tattatatta 


aatctaggca 


gattaaatat 


ttatatatgt 


cagttagata 


tattacagca 


gcacattgcc 


aaagacccaa 


accagaatgg 


cttaaacatt 


ggaagcactg 


attatttcac 


aacaagacta 


gtggcttatc 


atttctagga 


tggcttcatt 


cagtggtgca 


aggatttcat 


caaggattgc 


tagcagacca 


catcagaggg 


gagatggttg 


acacatccaa 


cttttggtaa 


atatttcata 


tacatattcc 


caaatcaact 


gtcagtaagg 


ataacggact 


tgccattatt 


ggcttggact 


aatcaagatt 


tacctctgta 


gctgggagga 


gtctacccta 


cctttagctc 


atgagaaact 


gaacaaaatg 


atggtttagt 


tagcaaggaa 


gaagaggaga 


tgtggctggt 


acacaagcag 


tttctaatat 


ctagtctact 


gactatcgaa 


actcaggaaa 


ttgttacctc 


agaaaatgaa 


aatgttattt 


gatatatttt 


gtttcataaa 


ttcaataaaa 


taactaatta 


ttataattaa 


tattaaatta 


attataacta 


ttattaatat 


tacattaaaa 


ttagaattgg 


aatttaaatt 


taaaaacctt 


ctcccaagag 


aaaacacaag 


atagttttaa 


tgcaaattga 


gagtaaaatt 


gagaataaaa 


taccaagttt 


tatccattga 


agagagaaaa 


ataaaattta 


taagttgggt 


ttatgttaga 


atcagggaat 


ctattctagc 


attgcagggt 


ttccatttat 


ttctaaaact 


gagaatgagg 


tagtgtagag 


ttttatagaa 


gagcttgctt 


tttgatcccc 


ttcacctgtt 


tacagcagaa 


gtgcaagatt 


ggaagaacag 


taaagctgtg 


ttttacttct 


agaaagctgc 


acaatacttc 


atatatccaa 


aaatactgat 


accaat aaca 


actcttatta 


aac t c t t act 


atcatcrtQQa 


cactc tact t 


aqtQC t ttat 


atgtgcaatt 


tcatttcaaa 


caatcaccac 


gaacttctga 


tttttcatta 


taatcatttt 


atggatgaag 


agaggttaag 


taactaaccc 


aacatcacaa 


ctgcttggta 


caggaatgaa 


gattcaaata 


gggtcaatgt 


attttcagaa 


ttgcatacat 


ctaactactt 


agttttctgt 



34620 
34680 
34740 
34800 
34860 
34920 
34980 
35040 
35100 
35160 
35220 
35280 
35340 
35400 
35460 
35520 
35580 
35640 
35700 
35760 
35820 
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ttttatttcc gtttcctttt caagatcaca 
ccacacatgt tcagtagatt accaggaact 

5 

tgagactttc gtaggtgttc cattagatga 
cggctacctt ccattattct ttcaaccttt 
10 tttttctctt tcatacagta taagtaccaa 

ttttggcctc ttcagttctg tttctgtata 
atagtacata catacttttt cctgtggcaa 

15 

aatacacaga catacataca cacttgtaat 
ttccaaggtg ccttattgaa tttttagaat 
20 gaatgtttgc aatatatatt gttcatactt 

ctatttttct tatgtaatta ttttggtaaa 
gcatctttat cttgattttg actttaatga 

25 

ataaagactt ttagactgag acagatttct 
tttttttttt tttggttttt ggtttttggt 
30 caggctggag tgcaatggca cgatctcggc 

ccattctcct gcctcagcct cccaagtagc 
gctaattttt ttgtattttt agtacagacg 

35 

tcgtgatcca cccgcctcag cctcccaaag 
cctgccaata attctttata ttttaccagt 
40 tattaaatgt cctagcatat caacactgta 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnntgt ttgtagagct 

45 

tttatatcat atgatgggtg attccctcta 
catgaatcat ctctatgtct gggtatttgg 
50 tagttctgta ttttttatat tatgatataa 

agcaaattaa aaactccaac aattgaatat 
agttatttta gctaaatgaa aaataattgc 

55 

tctagtcatc atctaattgg gaatttacat 
aaaatattgg ccgggcgcag tggctcacgc 
60 cgggcggatc acgaggtcag gagatcaaga 



ttaaaaacca 


gcttacatgg 


tgcaagcctt 


36480 


gcaggagcca 


tttgtccgct 


gagacatcac 


36540 


gcctttactt 


aattggtgaa 


ctcttttctg 


36600 


ttttccaaac 


ctcttctatg 


aagtgcctcc 


36660 


taccaagaac 


cagtcacaag 


ttcctcttat 


36720 


tgtttcacat 


tccttgtttt 


ttcctctgag 


36780 


atgaagtcct 


cttattcaat 


cccatcttca 


36840 


gtgtatatat 


aatttctgta 


tagaatgttt 


36900 


agtttatggt 


aatttttagt 


tcacgttctt 


36960 


aaataagaat 


agttaaacca 


cttgttccaa 


37020 


tatacccaga 


atagtattac 


ataatagtca 


37080 


aatgatccta 


catttttatc 


attaaacata 


37140 


atcatgttac 


agaaattaca 


attctttttt 


37200 


tttttgagac 


agagtttcgc 


tcttattacc 


37260 


tcactgcaac 


ctccgcctcc 


cgggttctca 


37320 


tggaactaca 


ggcacccgcc 


accacacccg 


37380 


gggtttcact 


gtggtctcga 


tctcctggtc 


37440 


tgctgggatt 


acaagcgtgt 


gccaccacgc 


37500 


aatgttgcag 


ttactaataa 


tgttgaattg 


37560 


aacatgaata 


gcaataagcc 


tgtnnnnnnn 


37620 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


37680 


gaaggaacat 


cccaaatgtt 


catgatgtat 


37740 


gtggttttat 


tccagtggtg 


atttagatat 


37800 


aagttaaata 


aatttattga 


gcttttttat 


37860 


tctttgaatc 


atctgaaatc 


aaatttgaaa 


37920 


taaaagatag 


agagtctaac 


caacttgact 


37980 


ctattgatgg 


aatcaaatga 


actactatat 


38040 


ttttttgcta 


ctattttgct 


tataagttta 


38100 


ctgtaatccc 


agcactttgg 


gaggccgagg 


38160 


ccatcctggc 


taacacggtg 


aaaccccgtc 


38220 



V 
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tctactaaaa acacaaaaaa ttagcccagc 
tcgggaggct gaggcaggag aatggcgtga 
5 gattgcccca ctgcactcca gcctgggcga 

aaaaaaaaaa aannnaattc taaatgagaa 
gtcatcaatt ttggtaccat gtaagaatta 

10 

tagtgccctc aaacagtttt aatatcattt 
ctcccatgga tttatctggt attatttctc 
15 gcaaaccttt gtggttttct ttaagataat 

aattaagttt agtaagttat gactcccaat 
tctctcattt ttagtttatt atttcaggat 

20 

caaaaggcct ggtgagtaat attttctttt 
aaaatgtatt ctgccagggt gtaaaacagg 
25 gcgattgtgc attttaccta ctagaatctg 

tgacatagga ctggatattt tggcaaggta 
ttgaagatat agagaattat tgagaagaaa 

30 

gcagcatgag tcagcccatg atagtgacac 
cccattgatt ataatgtaga ctccaactgt 
35 agaaattcac aataaggtta ttacataaga 

gatgcctttc ataaagccaa tacaatgcga 
agttaaagtc catatttgag ttttctttgt 

40 

gatcaatatg ggttttccaa atctggccta 
agccacatta ctcatactcc tcacactttg 
45 ttttttctgg tttttggttt ttggtttttt 

tggagtgcaa tggcgtgatc tcggctcacc 
ctcctgcctt agcctcccga gtagctggga 

50 

tttttgtatt tttagtagag atagggtttc 
tgacctcaga ttatccaccc cgcttcggcc 
55 cactgcgccc agtcagcaat tttagttctt 

aagagaacgt acagaaaaga atatactgct 
catgacaggc aaacagcatt tagaaaaatg 

60 

ggcagggaag gacacagaca gggctgtcca 
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gtagtggcgg 


gctcctgtag 


tcccagctac 


38280 


acccgggagg 


cggagcttgc 


agtgagccga 


38340 


cagagcgaga 


ctccgtctca 


aaaaaaaaaa 


38400 


taattttagt 


gtattttttg 


tttaaatata 


38460 


ctttgtatca 


ggagtttgga 


aactatcttt 


38520 


cagttagata 


tttgaaaaac 


ttgataaatt 


38580 


tgttctttct 


gggaaaaggg 


aaaataatgt 


38640 


tggtctgatt 


aacatatcta 


tctaatttag 


38700 


actggcttaa 


agttttttac 


cttcattttt 


38760 


ccttgtcaat 


tgcagcagga 


attagaacta 


38820 


tttggtatca 


aaaaaaggaa 


ggattaataa 


38880 


ttttgtttct 


ttgtttgtaa 


aaaatcagga 


38940 


tctttcaaaa 


actgcagtca 


ggctaattgc 


39000 


gggaatctta 


cgcttgtgga 


gggaatcttt 


39060 


tatcaagtgg 


gaccgctcca 


acaaaaccct 


39120 


caagcattgg 


tgttgtgtaa 


tatttactga 


39180 


gtccagaaag 


gattttgagt 


tttcaataaa 


39240 


agcaaattta 


aaatgtaatt 


ctcatgccaa 


39300 


aagtccatat 


ctgaataaat 


gtttttttct 


39360 


tcaaaagata 


aatatcaaat 


tctggagcat 


39420 


acccaactcc 


ctagtgtccc 


ctccggctgc 


39480 


ctgtgcaagt 


ttagttcttt 


tttgtttgtt 


39540 


gagacagagt 


ttcactctta 


ttacccaggc 


39600 


gcaacctccg 


cttctcaggt 


tcaagtgatt 


39660 


tcacaggcat 


gtgccaccac 


gcccagctaa 


39720 


tccatgttgg 


tcagactggt 


ctcaaactcc 


39780 


tcccaaaatg 


ctgggattac 


aggcatgagc 


39840 


attaaaaacc 


tec ttacctt 


tatccatQaa 


39900 


gagagagata 


atatacaaaa 


ggtatagagg 


39960 


ataagaaatc 


aggcatcaga 


agcaaggagt 


40020 


gctaaagtgt 


gaaggacatt 


gaaaaccctg 


40080 
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ctaaagagat ttggagatta tattatgtgc 
cggaataatg agatcatgtt tgtgttttgc 

5 

gttagaagac tattgaaata atcaagataa 
gtgtaaagtg attcatggag atatatctga 
10 attcctttct aggtccgagg tagtctagga 

tcagggatct taatcaacac atttagcttg 
aactactatg gcaactcaaa taattgtcat 

15 

gcctcttttc aggggatatt attatagaag 
acaataatta atattcccta attgatgaat 
20 agaataatct ttcatttagg agttaggtac 

tatcatgagg caaaatggga agaaagctta 
actgtgctgg gtattttgta tacatttttc 

25 

tactgttatc ccctctaaaa atggggccct 
taaaattgtc ttgaacagta ttttacaagt 
30 tttggatgtt cttgcaaggt taaccacttt 

cagtcttcat tatgactttt actacttttt 
ctaattgctt tattctgaaa atgccaccta 

35 

gatattttgg gccattgggt aattgtgggg 
actaaaatgc acaattaaat ctttattatt 
40 cctgtaatcc cagcactttg ggaagatgag 

agaccagcct ggccaacatg atgaagcccc 
gcatgatggc acgtgcctgt gatcccagct 

45 

gaagctggga ggcggaggtt gcagtgagcc 
aacagagcga gactgcatct caaaaaaaag 
50 caaatttttt gttactttcc ctctatatga 

ctcttcctta ccacgttctt cgcttatgta 
actaagcaga atcagttgca ttctgtttac 

55 

tgggctatat aatcacacca aaataacccc 
cccttgagat ttctattgct ttttatagta 
60 ataagaaaaa gcttctatag agaatttgac 
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ccacgggaat 


catctgaaga 


attgaagctg 


40140 


caagagatta 


gagttgaaac 


cacaagacaa 


40200 


tgatagatgg 


cagtaagaaa 


gagatggtgg 


40260 


ggatgttgtt 


ggattgttta 


attgattttc 


40320 


atgaatatca 


ccagctctgt 


actggctgca 


40380 


gcgttttatt 


cattccatca 


cccttactgt 


40440 


gggactatgt 


ccatcttaat 


ggttggtact 


40500 


caagtttggg 


gagtgctggc 


tctggcaaag 


40560 


gactttggga 


agttggagtg 


attgagaaat 


40620 


atgtctcatc 


tctgtcttta 


ttcttctata 


40680 


tagttataga 


atacctactc 


attaatagtt 


40740 


tgtttaattt 


taaaacattg 


atgcagtacg 


40800 


gatgtcagtc 


cctattctct 


aataatgaag 


40860 


ggtgagtagc 


tcagtatcct 


ttccaacatc 


40920 


tatcctcaac 


atctagctcc 


tgataaatct 


4098.0 


tgtaccaatt 


gttcttatct 


taatgctatc 


41040 


cacaatattt 


tgtaggtagt 


aaaacttctt 


41100 


aggaaaagtt 


attgcagcac 


atcacagcag 


41160 


aaaaagcctg 


gcagggcgca 


gtggctcacg 


41220 


gtgggtgggt 


cacctgaggt 


caggagttca 


41280 


gtctctacta 


aaaatacaaa 


aattagccag 


41340 


actcgggggg 


ctgaggcagg 


agaattgctt 


41400 


gagatggaca 


cattgcactc 


cagcctgggc 


41460 


agaaagccta 


ttggtcattt 


tgaacacctg 


41520 


actttcagcc 


tttcaccttg 


ctgcttagtt 


41580 


aacgtggaaa 


cccaacattc 


ttcctaactt 


41640 


ttcattccta 


gctaattcta 


catagttttg 


41700 


ttgaggtgtc 


tttgctaaac 


taatttactt 


41760 


agcaatttta 


cttcttgcca 


ctcgctcaaa 


41820 


aaattctttc 


aaggtctttc 


agttagaaac 


41880 
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tggcaaaagg 


aagattagat 


tctacaattt 


cttgtctcta 


attcccctat 


tatttgctat 


41940 


ttgtatcctg 


cagaaggacc 


ttacttttag 


tttacgaagt 


tctcttcatt 


ggtagatttt 


42000 


aagggcacaa 


tgatagagac 


aaaatctgga 


attggccaac 


ttgatagggc 


tgatttcgtg 


42060 


ggcatgggac 


ctgtgcagtt 


actcaggact 


tcacagttaa 


aagagccctg 


tgcttgatct 


42120 


aatgctctgc 


tgttgctatc 


ttaaaattgt 


ttttaaaaat 


tttaaacaag 


agaatctaaa 


42180 


ttacatagct 


ggtcctccat 


tcatctcaat 


cacctttccc 


agagtccatg 


atgaattcct 


42240 


gttaattgtg 


tgaacatctt 


tatgcctgtt 


cttatagcag 


aggacctcca 


taagggcttc 


42300 


ctgggatttg 


aagtaatggt 


tctattcttg 


cctgggcacc 


tgcacaatca 


gtccacaggt 


42360 


ctatatacat 


gctacctttt 


tttggaggcc 


cagccctgta 


cctgtttttg 


taaataaagt 


42420 


ttcataaata 


aagtttaatt 


gaaatacaac 


gaagcccatt 


tgtttacata 


ttgtggagtg 


42480 


ctgctttcgt 


gttataacag 


caggtttgag 


tatgtgtgac 


agagacctta 


tggtcggcaa 


42540 


aatctaaaat 


gtacattatc 


tggcctttac 


agaaagcttg 


ccgtcttcta 


gtctacactt 


42600 


ctagatggta 


aaatccttga 


agacagggga 


attgtctgtt 


ttgtttgctg 


ctctatcttc 


42660 


tgtgcctaca 


acaatgttgg 


aaacacagag 


gccactacag 


ctgttgaagg 


cacaagaaaa 


42720 


taaataagca 


atgaatgcaa 


atattctcaa 


ggtctcctta 


agtctgcctc 


actcatatcc 


42780 


tgagctcttg 


gtctattggc 


tcttctgcaa 


cagagggact 


atccctaacc 


atggagagaa 


42840 


aaagagagaa 


tatttaaacc 


agataatccc 


tgcatcaatc 


c 99999 aaaa 


ttaatagaaa 


42900 


taagtgtact 


gcccaatgtt 


ttccaaattt 


tgaccatgtt 


tcttggctga 


agacnnnnnn 


42960 


nnnnnnnnnn 


nnnnnnnngt 


ctggatggca 


tggtgctcct 


cttaagtgtg 


ctggaattct 


43020 


gcattgctgt 


gtccctctct 


gcctttggat 


gtaaagtgct 


ctgttgtacc 


cctggtgggn 


43080 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


gaattcagaa 


43140 


aagcagtggt 


tttggggcat 


atgatagcta 


ggtctctatc 


tcttcctgtt 


cttccaccaa 


43200 


- ccctatgtca 


cactgacttt 


gtggaagaat 


aaccagaact 


cttcttagtt 


ttgtgatgtg 


43260 


tgggccagtg 


tggtttttgc 


aatgtattat 


tcataactag 


caaatcatga 


tttgggtatg 


43320 


ttgatgtcat 


cagcatgaac 


aacctggttt 


ggaagaagag 


ccccatattt 


ttcataacac 


43380 


cttggtttac 


ccaacgtact 


gtgcactgtc 


atccacacgt 


aatattgatc 


cacttgcgag 


43440 


accagagcag 


tggtcatcag 


cagctcaggt 


caagagctat 


cactgaatgt 


tgtgtcctaa 


43500 


atcaaaccta 


atcaaQaoacr 


tqac tcracraa 


accccacat a 


cttqatcttQ 


acraatcraatc 


43560 


agagaagaaa 


agagtaaact 


ctgataatga 


tgactttatg 


tgggcaggaa 


aggggcactc 


43620 


tgaggtgatt 


tgggtgactc 


acctttggca 


ttctgattgt 


ttcacaggtt 


gtgttaattc 


43680 


tgccatcaca 


ttctcacatg 


gcagaaacag 


catctcccac 


accacttaat 


gaggtttgag 


43740 
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gccaccaaaa 


gatcaacaga 


caaatgctcc 


agaaatctat 


gctgactgtg 


acacaagagc 


43800 


5 


ctcacatgag 


aaattaccag 


tatccaactt 


cgatactgat 


agacttgttg 


atattattat 


43860 


tatatgtaat 


ccaattatga 


actgtgtgtg 


tatagagaga 


taataaattc 


aaaattatgt 


43920 




tctcattttt 


ttccctggaa 


ctcaataact 


catttcactg 


gctctttatc 


gagagtacta 


43980 


10 


gaagttaaat 


taataaataa 


tgcatttaat 


gaggcaacag 


cacttgaaag 


tttttcattc 


44040 




atcataagaa 


ctttatataa 


aggcattaca 


ttggcaaata 


aggtttggaa 


gcagaagagc 


44100 


15 


aaaaaaaaga 


tattgttaaa 


atgaggcctc 


catgeaaaac 


acatacttcc 


ctcccattta 


44160 


tttaactttt 


tttttctcct 


acctatgggg 


accaaagtgc 


tttttccttc 


aggaagtgga 


44220 








l — *■ — - — . V — 1 — I — 1 — k — ■ 1 — V — - 


r*t"t"c , t~fT , 1~cir* 

V— • Lw w L^ V^- V— L> 


ttttctttcc 


ccataaaaaa 

V- V—» CL w CL V*j CL UUM 


442 80 


20 


taccttgaag 


tagcacagtc 


cgtccttgca 


tgtgcacgag 


ctatcatttg 


agtaaaagta 


44340 




tacatggagt 


aaaaatcata 


ttaagcatca 


gattcaactt 


atattttcta 


tttcatcttc 


44400 


25 


ttcctttccc 


ttctcccacc 


ttctactggg 


cataattata 


tcttaatcat 


atatggaaat 


44460 




gtgcaacata 


tggtatttgt 


taaatacgtt 


tgtttttatt 


gcagagcaaa 


aataaatcaa 


44520 




a L. Lay day L.a 


CL \— CL «w U- I- L. CI L, 


eft" ori'ar'rha 


rt*t"cctt"ttc 


aaa t~ araata 


tacrtaciaaaci 


44580 


30 


agactaagcg 


ttgatgtcaa 


aaatacagta 


tggtggtaaa 


taatgtgggt 


tttctaaaca 


44640 




gacaaccctg 


ggttttcatc 


ttgctatccc 


atctttgttg 


tttactactt 


atgtagctat 


44700 


35 


gaagatgcca 


caaaatttct 


ctgtgcttca 


tatttctttt 


taggaaggta 


gtaataatat 


44760 




tgtctggccc 


atacacctat 


gataattact 


tggtataaca 


aaggtagaaa 


aaccaggact 


44820 






CL V— CL y y CL 


c accra a ant t~ 


ttaattaata 


tatatacata 


tatatatata 


44880 


40 


aatagatatg 


tattagatat 


atatatcaat 


ttatcatttt 


attcagcata 


tatctgtaaa 


44940 




gtatctgtaa 


tgaggcagac 


attgttttaa 


gcatataaga 


aaatagtggt 


aagagagata 


45000 


45 


atatcttata 


tatataagat 


aatagtggtg 


aataagagag 


agaacatcct 


tcctctcttg 


45060 




taggttaaaa 


cttttatggg 


acaggacata 


gattaacata 


taaacaatca 


aaaccttttt 


45120 




atatattatt 


a era a err* t* aar 


toaaaaaatt 

t— ^— j ^— j CL CL CL CL lw 


aatcrfccrcfcacr 


aaaataactcr 


ctataataat 


4518 0 


50 


ggtgagagat 


gagtcattag 


aatgtaccca 


ggaaagtttt 


ttttttgagg 


aagtgatatt 


45240 




ttcattatga 


taagataaaa 


attattgaat 


ttgeaeggat 


attegtattt 


tccaattcag 


45300 


55 


tatatagtat 


ttctagattt 


ctcattctgc 


cctacactca 


catatctatt 


gtatccacct 


45360 




ttctccatgc 


acgcttttca 


tgggaatgta 


agtccctcct 


tctatcagtg 


ctaacctgtc 


45420 




tactttctcc 


agctccaggg 


atggataagg 


tgtggtaaaa 


gtcagttaag 


acattctatc 


45480 


60 


ttcttggttg 


aaagttggaa 


ggtaattagc 


aagaaccacc 


aaatagaact 


taatactatt 


45540 
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gcttttcttt 


cttttttttt 


ttgaattttt 


ttttcttttt 


tttttgagac 


ggagtctcgc 


tctgtcgccc 


aggctggagt 


gcagtggcgc 


tatgtcggct 


cactgcaagc 


tctgcctccc 


cggttcacgc 


cattctcctg 


cctcagcctc 


ccgagtagct 


gggactgtat 


acaagcgccc 


gccaccgcgc 


ccggctaatt 


tttcatattt 


ttagtagaga 


cggggtttca 


ccgtgttagc 


caggatggtg 


tctcgatctc 


ctgacctcgt 


gatccgcccg 


cctcggcctc 


ccaaagtgct 


gggattacag 


gcgtgagccc 


ccgcgcccgg 


ccaactgtta 


atattttacc 


atgtcataaa 


gagatattgg 


tctatgtatg 


tacctaattc 


caatctttta 


attattgtgt 


atgtactgat 


atcctacttg 


ttttatttct 


tcaatattgt 


attaatactt 


ggcagatttt 


gtactcggca 


tttattcccc 


atttgttctg 


ataatgatgg 


cctgatttcc 


tccaaaaaat 


caccattatc 


taactcttaa 


tttatgtaca 


agggtggagc 


tcacccaacc 


cccaatttca 


ggagttgacc 


tgtgtcctac 


atctgggaag 


tggcagaatt 


acatcaactc 


agctagagtg 


aagttctttg 


gtagccttgt 


gacctaggaa 


aaaacaattg 


tgtgaatgac 


tgacacattt 


ccaatcaaaa 


ttaatctcat 


cattttcact 


tcactgacca 


agaaatggtc 


tccccatctt 


tcctctttcc 


tgttgggttt 


ggagctgtga 


ggacgtaagt 


ttgctgccat 


cagcagcccc 


gtgacagaca 


gatgctcaca 


gacaaaagca 


gaactgagaa 


gttccaagtg 


aacgatcaga 


ccctaatgta 


attcttttga 


gtctccaagt 


caagttttac 


ccggagttgg 


cccaacccct 


ggggttttaa 


atatataaag 


acaaggaaag 


agatttattc 


agatagtgca 


ataactgggc 


aaaatccttc 


aacggcttcc 


tctctaattc 


agaataaaag 


ccaaagctca 


tataaatcct 


acatagtcct 


tacatcacgt 


gattctttgt 


tgtttcttgg 


atttttaaaa 


tatcctgact 


tatatcctga 


tatattgctt 


tgagattgtt 


taaaatggta 


agagagtgcc 


ttctcttggt 


gataattagt 


gcatacataa 


tttttttaac 


acttacataa 


gcaacctaag 


attagtgcat 


ttttagttgt 


aaaaagttca 


tgcaaatatt 


ttgtgctatt 


agtgcaagat 


attatgattc 


caacttttag 


acaagactac 


ctgcacttgt 


ttcatgtcat 


gtgatttgaa 


atttattctt 


catttggtta 


ttcttggttt 


tatctatatg 


tgattagaaa 


gcagtaggaa 


aatgtccatg 


agactttgtc 


tccttttgct 


acttttacaa 


taaaaataat 


tattgtttca 


taggtttttg 


tttttttttt 


tgagagaggg 


tctggctctg 


ttgcacaggc 


tggagtgcat 


tagaacctct 


gcctcctgga 


cttaagccat 


cctcccacct 


cagcctccta 


agtaaatggg 


actacaggtg 


tgcaccacca 


a t~* /■* /**• a fr f~* f~ a 


LUtLUtL.Ld.1, 
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ttccctgtct 


gcttaatgat 


tgcaagaggc 


tcccctgctg 


ggttgaagtt 


gggccaagaa 


ttgccttagt 


agtggttgat 


acggttggga 


tggtatggtt 


aaagttgaac 


tcatcattga 


gggaagcatc 


ttgggaccgc 


acttcgcctt 


gtaaataatt 


ggaaaaaaaa 


agttataagg 
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tcttgagtat 


ataaagtgat 


attattgata 


acttctaact 


ataagtagtc 


caagctcttc 


aggtagatgc 


aagatatctg 


tgactaaaaa 


tgattttttt 


aaaaaatttg 


aatccagtat 


ttttgggttt 


tttttttgtc 


ttagactcac 


tgaggcaaaa 


tgaaggcttc 


acgctaagtt 


aaatttgtgt 


tatcaaatat 


ttctgtcccc 


tttctcctct 


aaaatctaat 


ttctcttctc 


taccagtcct 


aaatatcaaa 


gattataaga 


aaagtaagga 


taattaaaga 


ttaaagaata 


ttgaggcttg 


tgtatttttt 


cagtcaatga 


taaagaaacg 


ggcgatcaat 


aataagcaac 


tggtaagtgt 


actataatca 


gcattgctac 


aaaaactgct 


atctattgag 


aaaacatgca 


tccaactatt 


aaaagtaatt 


tatatttata 


aaatcaaatt 

« 


aaacccactc 


gtaaatgtaa 


gagaagtctc 


aaagggctca 


ttgttataac 


agttataata 


ggtattttta 


catttgaggt 


tgtaaaatga 


taatactgta 


tgcaaagtta 


gaggcaaatc 


aacagttcta 


acagtgatgt 


gttacactta 


cattactcag 


taaattggat 


tatagagcat 


ggaacaattt 


ggggaagaat 


catctcgggt 


gttcccactg 


accattagaa 


atctgtagtt 


tatcttaaat 


taactatttt 


taaaatttgc 


cattttccac 


aactcttact 


tttcctgatt 


ctgagtcctg 


atcccataaa 


cttgcaatac 


tctatggaaa 


ggatttcaag 


agagacatac 


ttttatcttt 


aataaaaaca 


tgaattaaac 


cagaaggaag 


gaaataaaga 


gttgaagccc 


actgctgtgg 


ctgctgaagt 


aatagatgca 


ttttatctta 


atacttgact 


agaaatttcc 


tgtatcttca 


ttttgtatct 


tattatgaat 


cttcatcatt 


aaaatgtatg 


atagctttat 


ggtaggtctt 


gaaatcaggt 


agcatatgtc 


tcatattatt 


tttcttcatt 


aacaaaattg 


atttagtcaa 


agtctttaat 


tatttcatgt 


acattttgga 


attacttcgt 


caactttatg 


caatccaaaa 


agaccttctt 


ggattttgat 


taagaaggta 


ttgagtctat 


aggtctataa 


gggaaaattg 


ccaacttaac 


aatattgagt 


tttctaatcc 


attaacagaa 


tatacctttt 


cctttattta 


tttattattt 


ttatttattt 


atattttttt 


gagatggagt 


ctcgctctgt 


cacccaggct 


gtagcgcaat 


ggtgtaatct 


tggcttacta 


caacccctac 


acctcctggg 


ttcaagtgat 


■ j j ■ 

tctctcacct 


cagcctccag 


gtacctggga 


ttacaggcac 


acgccaccac 


gctaggctaa 


ttttttgcat 


tcttttagta 


gagatggggt 


ttcaccatgt 


taaccaggct 


agtcttgaac 


tccgtaaatg 


cacagcttag 


tgttagaaaa 


caaatttaga 


atacttccct 


gcagatttct 


ggacctctgt 


ctctgcatag 


ctatcccctt 


tatggttctc 


ttctcactaa 


ttatagctgt 


ctcactgccc 


tcaaattctg 


gtttttgtct 


tctgaatttg 


gcaatatcag 


aatggatccc 


tctccctgac 


catggtctgg 


aaagtttctt 


catgcagaaa 


accagagcaa 


ttattatatt 


cacatttgtt 


ttcatttttt 


cagtgttgaa 


agtccggcac 


tgtttcccaa 


tgtctggaag 


cagttgtgct 
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taattttttt 


ccggttttat 


ttatgatata 


aaaaaatttt 


tgagacaagc 


tctaactcta 


49260 


acacccaggt 


ttgagtgcaa 


tggtgcaatc 


atagctcact 


gcagcctcaa 


aatactgagc 


49320 


tcaagggatt 


ctccaacttc 


agcctcccaa 


agtgctggga 


tcaaaggtat 


gggccactgt 


49380 


gaccagcgca 


gttttctaac 


tgaccagagt 


ggaagattaa 


gtctggttct 


atcataattg 


49440 


gaagagctga 


gtttcaactt 


taaaaagatg 


atcataggct 


gggcatggtg 


gctcacgcct 


49500 


gtaattccag 


cactttggga 


agtcgagggg 


ggcggatcac 


ttcaggtcag 


gagttcaaga 


49560 


ccagcctgac 


caacatggtg 


aaacctcgtc 


tctactaaaa 


aatacaaaac 


ttagttgagc 


49620 


atggtggtgc 


gcgcctgtaa 


tcccagctac 


ttgggaggct 


tggcaggaga 


attgcttgaa 


49680 


cccgggagga 


ggaggttgta 


gcgagccaag 


attatgccac 


tgcattccag 


cctgggccac 


49740 


agagtgagac 


tccatctcaa 


aaaaaaagct 


catgaacaca 


gatgttagca 


tttttatttc 


49800 


atttgtttat 


atttaaaatc 


gatttaagca 


tgtcaacata 


tgtaacttaa 


taaaattaat 


49860 


acttataatt 


taaaatattc 


aagcagaaaa 


ttgaagtatt 


gtccttttca 


tatgctgata 


49920 


ttttagtcac 


cttaattggg 


tttcataaga 


agcaactatt 


aaaagacact 


agcatgttgc 


49980 


aattattttt 


tatttaaatt 


aatttgtgtt 


attttgcaat 


gttttatata 


gtaatatcca 


50040 


gacagtttta 


atatacactc 


aaaagttttg 


ttttcacaca 


acataaatat 


aggaaacaca 


50100 


atgagacaaa 


gctatacaaa 


ttccagggaa 


aatatttaaa 


tctttgaatt 


tattaaaata 


50160 


tgtctctcag 


tttgaatttt 


gagtttactg 


taggaaaaca 


cagaaatgat 


gtgtaaagta 


50220 


acttcctttt 


gattgcctgg 


tggatagaaa 


aacaaaaaac 


aactaaattt 


gagatatggt 


50280 


ttgaacagag 


ctgatgatgc 


atttgttttc 


ctgtttgaat 


ctcctcaaga 


tcagcaattt 


50340 


gtttttttac 


tgactaaggt 


ttgttcagat 


tgctactgct 


catatgattc 


ccaggaactt 


50400 


aagtactttc 


gagtaactgt 


gtgaggctga 


taacactaga 


tttaaaaaaa 


atttgtcttc 


50460 


aataagtaca 


ctctttccag 


cagaggaagc 


agttcagaaa 


gttctttgtg 


gtttcccctc 


50520 


tggatctttc 


tatcatcaat 


ttcagcaaaa 


ggtggtattt 


ttcttttaat 


aaaaggtcaa 


50580 


gaccgcttgt 


tttaatcctt 


ctcagactcc 


aggtacagct 


taaccagata 


tagcctgtaa 


50640 


cttgctccca 


tctcttgaat 


gcatgctttc 


agcttggcca 


ggaggggctg 


gacatcaaaa 


50700 


ggaggagagt 


cacagtctgg 


atggggctca 


ggaacctcac 


agtgctgtat 


aagccagcta 


50760 


tcatctaggg 


gggttctttt 


ccccttaagc 


cagcagcaaa 


atgcttcaac 


ccttgtcaat 


50820 


uacL.UL.cagc 








LLL ay La. L LL. 




juoou 


ttagagaata 


ctgaaagcat 


tcccgtttta 


cctctaactc 


tgaagccaca 


gaaactgtgc 


50940 


ccataacagg 


gttgctaaat 


ggcctaaatg 


acatatatgc 


aaatatatat 


acacaaacac 


51000 


acatgtgtat 


ggaatatata 


tgtttgtata 


ttatttgaaa 


atatatttaa 


ataacttgca 


51060 
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tattacatat 


aaatttatat 


atatttcata 


tccatatatg 


catatacttt 


ttctcataca 


ttataaaact 


ctggcaacat 


ctacatgtaa 


catatgagtc 


tttatttgtt 


ttgctgacaa 


agattatctt 


tcccagaatc 


tggattagaa 


aatttagtta 


tttagtattt 


ggttgttatt 


atttcaatat 


tttgatttaa 


tttattttgg 


aacagaaatt 


agcctgcaaa 


ttctgcaaat 


ttttcttaga 


aatgtatcat 


atgcataaat 


ttgttttagc 


tgacctagaa 


aacggagggc 


ttatatttcc 


ctttttattt 


ttctatataa 


aagatatttg 


atttaagaag 


aaaatcagtt 


cgaagtcttt 


gtaaatctga 


tatattctct 


tttcccaatt 


gtattgccca 


agtcagagtc 


tattattttc 


cacaggcaga 


tatttgaaag 


gacttaacaa 


ataccacaga 


aggtactcac 


ataaggcttt 


tatctaaagg 


cagagagtac 


aatgcagcaa 


gagagagagg 


gcaagcactg 


gagacattcc 


aagcagtaca 


agacattcca 


gatagtacaa 


gctccccata 


gacctaacca 


gacacatgaa 


ccatgccagt 


ttctgacttg 


aactgcccag 


atctatgcaa 


gaaactctct 


atatcccatt 


aagtagtagt 


caagcttgtc 


aaactcatta 


ggctagttgc 


aagtcattag 


caaagaaact 


gaccttgggg 


cttgccaaga 


gagtttaaaa 


ttattctaaa 


ctgcatatta 


taaattctaa 


tggtcttatc 


ttgaccaaga 


gccaaaattc 


atacattcag 


gcatccccag 


gattaagaca 


gagctcttaa 


gtacctctgt 


aannnnnnnn 


nnnnnnnnnn 


nnnnnnnnn 



<210> 77 

<211> 84248 

<212> DNA 

<213> homo sapiens 



51959 



<220> 

<2 21> MS4A4E_initial_coding_region 

<222> (18861) . . (19004) 

<220> 

<22 1> MS4A6A_initial_coding_region 

<222> (67182) . . (67328) 



<220> 
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<2 21> MS4A6A_coding__region 
<222> (68944) . . (69078) 



<220> 

<221> MS4A6A_coding_region 
<222> (70593) . . (70649) 



<220> 

<2 21> MS4A6A_coding_region 
<222> (73298) . . (73507) 



<220> 

<2 21> MS4A6A_coding_region 
<222> (75780) . . (75881) 



<220> 

<221> MS4A6A_coding_region 
<222> (76656) . . (76748) 



<220> 

<221> MS4A4E_coding_region 
<222> (42168) . . (42236) 



<220> 

<221> MS4A4E_coding_region 
<222> (40220) . . (40321) 



<220> 

<221> MS4A4E_coding__region 
<222> (35782) . . (35940) 
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<220> 

5 <221> MS4A4E__coding_region 

<222> (34287) . . (34343) 

10 

<220> 

<221> MS4A4E_coding_region 

15 <222> (23822) .. (23950) 



20 



<220> 

<221> genomic_DNA 

<222> (1) . . (84248) 

25 <223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 



30 



35 



40 



45 



50 



55 



60 



<400> 77 
nggataatat 


taggctaaat 


acctaatgta 


gatgacaggc 


tgataggtgc 


agcaaaccat 


60 


catggcacgt 


gtatacctat 


gtaacatacc 


tgcacttctt • 


cacatgtatc 


ccagaactta 


120 


aagtataaaa 


aaattaagaa 


atttttaaat 


aagaaacctg 


aaatgtagat 


attctatagg 


180 


aaaagtgaca 


tggtttcttt 


aatagataaa 


tgacatgaat 


aaaaaagagt 


gggaagggga 


240 


aaatatagat 


tgaaggagac 


ttaattaaca 


tatcaacaaa 


atgcaatgtc 


tggcctttgg 


300 


atggatgggg 


ctggggtgtt 


cttgacttaa 


gaaaatgaac 


aaatatgtat 


gagaaaataa 


360 


taaaatttga 


atatgaactc 


tatattcgat 


gatactaagg 


aattattgat 


aatgctgtaa 


420 


aatgtgataa 


tgatattaga 


aaatgttatg 


gaataagtgt 


catctgttaa 


tgatccatac 


480 


caaggtataa 


actagtgagg 


tgatattaaa 


tgtgagattt 


actttaaaat 


actcttacaa 


540 


atatgtgcag 


ggtggagaga 


gatgagaaaa 


cattggcaaa 


ctatcgataa 


ttgttgaagc 


600 


agactggtgg 


atgtatggta 


attcattata 


ctaacctccc 


tacttttgtg 


tgtgttttaa 


660 


attttccaga 


agaaaaagta 


aaagacagaa 


agtggtgatg 


gtttcatgag 


tgtatattta 


720 


tctaccaaat 


gatcaagttg 


tatacattaa 


atatatacag 


ctttttgtat 


gccaattata 


780 


cctcaataaa 


gtggattttt 


aagacagata 


agaaataaaa 


ataagataga 


taaaaactac 


840 


acgtgattgt 


gaaattaatg 


aaatgagtgt 


ttttagctcc 


atcagttcag 


tttggttctt 


900 


tcttaaaata 


gcaatttcat 


gttttatcta 


ctgtactatt 


ttattgtatt 


ccttagattc 


960 


attggattag 


gtgtagactt 


tctgctgttg 


atgatctttg 


ttcctatcca 


tgttctgaat 


1020 
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tctacgtctg acatttcagc ctttcagcct 
agtaatttgg agataagaag agactctagc 

5 

cagtcatctg tttgtgctga tgttccttca 
tttttttttt cttttattgc ctttgatgcc 
10 tttagttgat tggctttgat tctagtcagt 

gtattggatg aggcttccct gattactgtc 
ctggtccaca gaactccctt aggcaggggc 

15 

gcttagcgct aatctgctgt ctgaatgctt 
tggagtttag gtagaagcag gaccactggg 
20 caaagattgg tggagttgcc caccctatca 

cacttcttgg taatttcagg cagaaagaag 
tgtggctcac ctgtctacca gtggtaggag 

25 

tgcttcccag ggaaacagga ggctgcacct 
ctaggccaga actctagcag gcattgccca 
30 tcacctgccc tgctgtcagg gtgttttctg 

cctaaagact gcaatttctg gctgagttca 
ctagcaagca ttgcctgccc ggctaccagt 

35 

tgggtgcttt ctgggacaag agtaggctat 
aatgctgggt aggaaggtgg cacaaggaat 
40 acagtcccta atcaagagag gcaagagcta 

gactgcaagt gtcttttctt ggtagaaggc 
tgtctggtag tttgtcattg ataccttctg 

45 

ttctaatcct acacaagcat tttttaactc 
atcatcttat ttttatcaag aatattgctt 
50 gtattctgga tatatatcct ttcttaatta 

gggttgtcaa tttttgctat gtatatttag 
aaatgattac atctttctga agattgtact 

55 

tcatcagtaa catattttgc tctaaggtca 
ggtttgctcg atatatcatt tctcatcctt 
60 ttacttcttt ttaacagtaa ttagttcagt 
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ggttaaaaac 


cattgccgga 


gaactagcgg 


1080 


tttttgagtt 


gccagttctt 


gcacagctct 


1140 


atctttgaag 


ctgctatctt 


ttagatgggc 


1200 


cttgggggtt 


tgattgtggt 


ataagttggg 


1260 


agactggctt 


tgatcctagt 


cagttcctgg 


1320 


tctctgctca 


catttattat 


gttgggcatt 


1380 


cacagttggc 


ggacaagtag 


tatccttaca 


1440 


tccagggtaa 


cacagtgttg 


tccccaccta 


1500 


ctataagctg 


gggtgtctgg 


ttacaaaagg 


1560 


tctgggtgtt 


tctggggcaa 


gagaatgctg 


1620 


tagcactgct 


gagctggaag 


ctccagcaga 


1680 


tgggtagggc 


tgtcctctct 


ggcatctgga 


1740 


gctggctcag 


ttcacagaga 


agtaggacca 


1800 


ccaggctacc 


agtggcaggg 


ttgggttgca 


1860 


tgattacagg 


aggctgtgcc 


ctctggcaca 


1920 


catataagaa 


ggactactgg 


tctggaagct 


1980 


gtcgggggtg 


cattgcttgt 


ccttcttgtc 


2040 


ggctactggc 


caagtccagg 


cagaagtgga 


2100 


attatgtgga 


gattatttca 


gaaccctaca 


2160 


aaggaatcag 


gaaccctagg 


agggctatgg 


2220 


ccaacagtta 


gctcaattgt 


ctgtgttaaa 


2280 


ttacataccc 


tctatgctcc 


atcaatcgta 


2340 


tgtatttctc 


ccacctcctc 


aagccaagcc 


2400 


acagcctctg 


aatgttccct 


ggttttttat 


2460 


catgtattaa 


aatattatct 


aatattctgt 


2520 


aggttctgtt 


attaaatata 


tacacactta 


2580 


atttttataa 


ttaaaaatta 


atccctttgc 


2640 


attttctctg 


atcagttttt 


tttgggttaa 


2700 


gtactttaca 


gtttttgtat 


gcattaaaat 


2760 


tttgaccttt 


taaaatttta 


tctgaaatat 


2820 
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ttgtatttta 


ttacatatga 


cgtatttatt 


tatatatttg 


ggattaaatc 


taccatcaca 


2880 


ctattgactt 


tcctggagga 


aataaatgtt 


ctaaatgtta 


ttagaatttg 


ataaatcaaa 


2940 


tacacatgct 


ttaattttta 


ggattacaag 


aacacacaat 


gaaagacttt 


ataagtacta 


3000 


aaattaatta 


gagggagaaa 


taatgcaata 


aaggggtata 


tagaaccttc 


aatcaaaaaa 


3060 


atagaaaaca 


tttttcgttt 


gttcatttgt 


ttgttgtatt 


tggtttctct 


gttttacgat 


3120 


attgattttc 


ttaaatattc 


agtgttcatt 


tacactttca 


tttggtaatt 


tctcctatca 


3180 


tgtcagggaa 


tgtaggtctt 


gcttaccata 


gcttacatcg 


ggtgactgtg 


agagaagagg 


3240 


gggcataaag 


ccaggttata 


catacagatt 


ggttgtgatg 


tagttgagag 


aaagcccctt 


3300 


ccttctccag 


actaaggtga 


cctctgtgtc 


ttcatcatca 


ggatcccagc 


tctaaagagt 


3360 


gagcgcagga 


ctttcatttg 


ggaatcaatt 


ccataatatt 


ctctgctttt 


tgcttttcct 


3420 


ttataccttg 


tgaatggaag 


cttagacctt 


aattcttgtc 


tctttcctgt 


tttttactca 


3480 


gagctggcac 


tcacattcta 


tatacttgtc 


tagagcagaa 


ccctaaaaca 


gttttttggt 


3540 


ttggtttttt 


gctcataaaa 


cagttctcat 


attttggttt 


ggggataaac 


taccttgtta 


3600 


ggtgatgcct 


gtgtgaggga 


cagaggaaca 


actgaccccg 


cttttccaat 


ataattttaa 


3660 


aaattaatta 


acaactgctc 


tgggctctga 


cctactctgc 


tgcttgttgt 


tttgacactg 


3720 


aacttgagct 


cttttttatt 


tgcttttatt 


tgtatagtag 


ttagttttgt 


cctatgctta 


3780 


tttagaggta 


gggtgattct 


atttatgttt 


ctctgtcaaa 


tttcaatttt 


attgttcttt 


3840 


atatcaccat 


aaatcctcaa 


tcacttgtct 


gctaagaata 


acactaataa 


tatctcactt 


3900 


cacactatat 


ggaagaggag 


gtaagattga 


catctgttct 


tagtctagaa 


tttcaacctc 


3960 


tcttataaag 


tcatgaaact 


ctgtaccttt 


gtcttctgta 


atgccacatt 


ttcctagttt 


4020 


gcctgttatc 


ttctttaatc 


tctttttgcc 


atcctctgga 


aacccttatt 


tctgataata 


4080 


ctgtttctat 


tctctagcaa 


aaagggtctg 


attttttctg 


cctcctccca 


caagcctggg 


4140 


taacctcatc 


cctctgggtg 


aagagctgtt 


gggtttacct 


ctacagccat 


gttctcattc 


4200 


cctagctcct 


gccagttatc 

* 


caacttttgt 


gaaacttatt 


tgaaggtgcc 


cacacaattc 


4260 


aaattaaata 


tgctccaagt 


cttactcttt 


gttcctgcag 


ttctgtcttg 


gtccatgtct 


4320 


attgccttac 


tgaatggctc 


tatcattcat 


tcagtttcct 


aacccagaat 


ctgagcgcca 


4380 


tcctttcctg 


ttttctctcc 


tgatttacac 


atccattata 


tcaacaagta 


ctgtcatttc 


4440 


t^^y i— »~ v_- \_» cia. 


cx v_ cx * — u i — v_ \— \ — • a. 


ca. d v_» ci ^ — \— v> c*. 


ttctctccaa 


cttcactaac 


a t cataacrt a 


4500 

•4- * V» V 


gttcagtaac 


catcatccct 


tccctagaga 


actgcctctt 


tcttaaccct 


aactccctga 


4560 


tttcagtatt 


atttattcct 


attgttgctt 


tacatgaaaa 


ccaggacaat 


atttctgaaa 


4620 


tataacccat 


ttaattctga 


gaattaaaat 


aatttcagat 


tatagacaac 


aaagtttttg 


4680 
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ggattaggca aaactggatt tcaatctcat ctataccagc tatatgaata tctctgagtt 4740 

tcagttccat cttctggaaa atgggaataa taatgttacc tagctccatc cttcttccat 4800 

ttgtgtgtct tttgcttatt catcaggtct ccaattagat gtccctttat ctaggaagtc 4860 

tcctctgcct acccaacgtt gagttctttg atcctcctat gagctctctc aggatatcca 4920 

attttacctt gcagatcact gattataatg ggttgtaatg atttatttat tcatcagcat 4980 

tgccacttta ccttgcaggt aactgattat aatgggtgat ttatttactt gtctatattg 5040 

tcaagtagcc tataaaatcc ttgagcgaaa atcatttttc accattttat ctgaagcaca 5100 

tggcacactg cttggcttat agtagctttt ctctaagaaa atgctgagtg aatgaaaatt 5160 

gcatgccact ttgctttcct ccatgtaaag aaggcattat tttattttac caatgaatca 5220 

ggttcagggg attaacaagg tcatacaaga aatgattgcc attgggtttg tcttagtcaa 5280 

agaccagctt gttctatgat accagctgct tctttgactt agaaaggtgt tgtattaatt 5340 

attgctgtac aaccaacttg ttcccatttt atggctattc atgttgaacc tccaaaggtg 5400 

aaatatcatg aaaataagat atttaagttg tgtatattaa ggcacaatta ctttttcacc 5460 

aacctaatac aatagtaaca gtagttctct gggaagaaga gtactgtgga aatgtgtgaa 5520 

attgtgaaag ggggcattta taaattacca tttacttagt tgtgtatata acaatctttg 5580 

agcagagtct ggcaaacaga aaatcctcca ccactgatac ccacagtaat tagcactatt 5640 

attttcatag gaaagactgt atcaggcagc cactgctttt tcagcaaatt tcaaaataaa 5700 

cgtccacacc catattggga acgcaaaatc agttaatgct tccgataaaa aagaggaact 5760 

agttgtgtgt ctcagaagtt ctcaacacat cctttaaggt tccaaacatc tactagaaaa 5820 

ggagtgcaga tttaaactga gtgaggtgtg gagtggggga agctgactgt gtctggaaca 5880 

aagaactttg aggaacttgc ccagagccct gcactcatca gacctgcagc agacattgca 5940 

ggcctgaaga aaggtaggtc cagggacttg ccttcctagg ctctcggact gatagtttgt 6000 

ttcatttcct gggtttagga tttctgggag tggagtgaga aaaaaaaatg aggtcacttc 6060 

ccatgaccca agctggtagt cagggttggg gtggattgag tgcacattta aatatgcatt 612 0 

tggtgagggt tgtgggtggt tgaggaacag tcttcgctac agttgatggg agttagaaaa 6180 

gcctcctaaa ggctgacact ggaggctgga tgaagttgtt ttagttactt ttgttccgta 6240 

agttctcagg ataccaaaac actgtaagca acattttaaa ctttgattaa ttacctcatc 6300 

tactgttaat gcacttcttc catggtcttt attctttgtc tctgactggt tatttttata 6360 

attcaagact taagaaaaga tctagcacgt gcagaatttg gtgtacatgg agtagggcac 6420 

aggcagggtc cccagccttg tacatttaga tgggctgtgg cctgacgtag ttctccagtg 6480 
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gtctttggca ctattcatgt ggcacccagg 
tgcaaatgaa aaacatcttg tagactttta 
5 gtagtcgtag tcattgtaat ctttccccag 

taatgtacct agaatcacac aagttaacaa 
tggttcctag cccaggcctt tccccctttt 

10 

atcatcctcc taaaaaccat ccagattcag 
aactctgcca agaaaaaggg gatgaccgtg 
15 gtgacccata agggggatgg tgacataaaa 

gatccacaag gagactgggg ccaatagaag 
tttatccggg gtggggagga ggggcgggga 

20 

tataggtgat gtggcaaaac caaggtttag 
ccctcatatg tatccaatat tctctgcagt 
25 tggagtcttg taaacacacg ctttctttcc 

cacatggatt tctgtggtgt aaagatagga 
cttataaaga cagagcacca gccttgaatg 

30 

tatggaaaat cataagagtc aattataagc 
gaagcttaca gccttacccg agaggcacta 
35 gataagacag acccaacatg tcctaaattc 

gccgagtttc ccttacatta aagaataggc 
gcactttggg aggcccaggt gggtggatca 

40 

aacacggtga aatcccttct ctactaaaaa 
gcgccggtag ccccagctac tctggaggct 
45 cggagcttgc agtgagccga gattgcacca 

ctccgcctca aaataaataa ataagtaaat 
tagatggact aagaagaatt atcttcagtt 

50 

taaaaggaaa ataacataga tttcttaaat 
acatgcggga aatgggggct gtaggagaat 
55 aactgttagt gtgtgagaag attagatgct 

ggtcatggcc gggggagtgg aacattcctg 
atataggaat gaaattactt tagaacaact 

60 

aaaataaaaa ggagaaattc aaggtaaggc 
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cagtaaactg 


aggctgctca 


tgataaaatc 


6540 


gcattcttag 


gataagaaag 


aagcttgtaa 


6600 


atgaagtaaa 


tgaaatccag 


aaaagggaaa 


6660 


aagggctagg 


agcaaaaaac 


ccagaattct 


6720 


cccttcccat 


gccaccattt 


ctgcatcccc 


6780 


gcctcatttt 


gggtttctag 


tctatttgat 


6840 


tataccactc 


tcctcatgct 


ctcttgtgtt 


6900 


atgaggaaac 


aagagactgc 


tggtgctgct 


6960 


agggaagggt 


ggtctaggtt 


accccacctc 


7020 


agacattgta 


gttgtcaaga 


acctgatgca 


7080 


attcttctcc 


aacctgaaag 


aaatgtccct 


7140 


taaaaaatat 


atattttgat 


tgtagagaag 


7200 


ttggggacct 


tatctaccct 


agataattaa 


7260 


ggcagttata 


taaatttatg 


tcaagaatgt 


7320 


accagaacct 


gacctatgtt 


accaaagaat 


7380 


cttaatttga 


tataatggct 


cggccctcag 


7440 


aacacagagg 


gagaactacc 


aagtgtgttt 


7500 


tcagtctgaa 


ggtgtgcgtg 


agagattgag 


7560 


cgggtgcggt 


ggctcacgcc 


tgtaattcca 


7620 


cgaggtcaga 


ggatcgagac 


catcctggct 


7680 


tacaaaaaaa 


ttagccaggc 


gtggtggcag 


7740 


gaggcaggag 


aatggcgtga 


acccgggagg 


7800 


ctgcacgcca 


gcctgggcaa 


cagagcgaga 


7860 


aaataaataa 


ataaataaaa 


agtacacatt 


7920 


tctttcaaaa 


taactttggt 


ttctgtggtg 


7980 


ccttagttac 


cactagaggt 


tgagtaatcc 


8040 


ggtgtctgag 


agaagacaga 


aactatgcaa 


8100 


qtqqtqgqtt 


tgttgctatt 


cttgtaaggt 
— > — i — ' 


8160 


caaacgggct 


caacatatgc 


aaatgtctca 


8220 


taaataagtc 


caatatactt 


ggggctttaa 


8280 


tacgataata 


gatatgggtt 


gaagtagcag 


8340 
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caatatcaac 


aacattaaat 


ttttatcttt 


atactaaagg 


tggtgaaatg 


caaagaaagg 


8400 


actttaaagg 


ggagggtgac 


acagttgtct 


ttgcatttga 


tcatttggct 


acaaagtcag 


8460 


gggtgggttg 


aacatccaag 


gctacaaatg 


gacaccgggg 


aacattcaga 


ggctgtaagc 


8520 


aatactcctg 


ataaaaatag 


gatgatggct 


tggcacgcgg 


agatgggaga 


aatacagagt 


8580 


taaaaaatgg 


ttgcgtagaa 


ttagaatatg 


attgatggag 


catagaaggt 


atgtaacgga 


8640 


aggtatcaat 


gacaaactac 


cagacattta 


acacagacaa 


ctgagctaat 


tgttgggcct 


8700 


tctaccaaga 


aaggacattt 


gcagtcgcat 


agccctctta 


gcattcctgt 


tacctttagc 


8760 


tcttgcctct 


cttgattagg 


gactgcttgt 


agtgttctga 


agtaatctcc 


acataatatt 


8820 


ccttgtgcca 


ccttccttta 


tctgaaagat 


aaattttgag 


gaagactgaa 


tatcttctaa 


8880 


tttggtcttc 


taatgtttga 


aataaaaatt 


acctcttctc 


tttgctggaa 


caatttgata 


8940 


aatgatcaaa 


ttatgggctt 


tattatagtg 


acttcaaaat 


taaccccaga 


gggggacaat 


9000 


tctctcaagt 


tcagcaaaat 


ttgagggtat 


ggattcagtt 


taagaccaaa 


aataaggcct 


9060 


gctattttat 


tttattcgtt 


tgctccaaat 


ctaaattttg 


tattctgttc 


aggggatgct 


9120 


gaagtctact 


tataggtttg 


attataccta 


ctcaataaaa 


tgtaaatgta 


agtcacctca 


9180 


taaactccat 


tatcattaga 


acagagatta 


caagaggaag 


aggagggcct 


cttggcccac 


9240 


ctctacatag 


tcttgtatct 


acctagtcca 


tgagagcaaa 


aacaagcaca 


accctgactc 


9300 


ttctgaccaa 


tctttgagac 


aaactgcaaa 


agagaggaag 


agtcaacctc 


tgacaacacg 


9360 


gtaccatgaa 


taacatgctc 


tcaccagtta 


tatagacccc 


tcctttctgc 


tttttcactt 


9420 


cccccaagac 


aggacaggtt 


ggagcagggt 


catgccagga 


ataaagcccc 


tctttagcat 


9480 


gtccttgagt 


ggtaggaagt 


gttagaggaa 


atttatgttt 


tatacttcaa 


gccagaaatg 


9540 


agtaagtgat 


agtaggaaga 


ccaataataa 


tagtcaatag 


gtagttagta 


tttattaagt 


9600 


gacatgaact 


ggttgaaatg 


tttacgtgtg 


ctatttcatc 


attaaatttc 


tctacgagat 


9660 


atgctgtatt 


attcacaggg 


tagagttaca 


gaaacacaag 


attaattaac 


ttgagtaagg 


9720 


agtggcaggg 


ccaaaattta 


aatccgggta 


atttaattcc 


agaactcgtt 


tgcttaagag 


9780 


gatccactac 


actagtctgg 


cgaatagaag 


aactctttta 


tttgcacatt 


tattcattcc 


9840 


tcagttgttg 


gacatttggg 


ttgtttgcac 


cttttgacta 


ttataggtaa 


tactgctagg 


9900 


aaaattcatt 


ttacaagttt 


ttgtgtggac 


atatgttttt 


atttctcttt 


gtatgtatct 


9960 


gggaatggaa 


gggataggtc 


gtaaggctgg 


ttcatggttg 


acattttgca 


aagaatcagt 


10020 


tagggaggaa 


tcctccttat 


gttttaggaa 


tagtttcagt 


aggattagct 


ctttggtatt 


10080 


tattagctct 


tctgtggtat 


aaacacttct 


ttgtacatgt 


gttagaattt 


ggatgtgaat 


10140 
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ctatagggct cagtaatcta atcttccacc ttaataaact ggaaaaagaa gaacaaacta 102 00 

aatggaaaca acttgtagaa aagaaataat aaagactaga gtgaaaataa tagagagtag 10260 

aaacacatta gagtaaatca acaaaaccca aagttcattc ttgcaacgat caacaaattt 10320 

gaaaaacctt tagctatatt gaatgacaaa aattacaaga agaaatcact aaaatcagga 103 80 

atgaaagagg cagcataatt accaaactta cagaattaaa gaaattataa gggatactat 10440 

aaaattatat ccaaataaat cagataacat atgaaatggt caaattttta tgaagatact 10500 

actaaaactg actgaagaag aaatacaaaa ataagaagaa atacaaaatc tgaatagatc 10560 

tgtaataaga gattaaatga ggaattttaa aaacttttat aaaggaaagc ccaggctcct 10620 

gtgatttcac tggtgaattt tactaaacat tttaaaaaga attaataccc ttccatcaaa 10680 

aactagaaga gaagggaaca attccaaact tattacataa ggctagtatt attttgacac 10740 

caaaatcaag aaaagacaac ataagcgaac aaaactacat atgaatgtct ctcatgaata 10800 

cagacacaaa tgctaaaaaa aaattagcaa gcagaattcg gcaacatgta aaaaagatta 10860 

tatatgagga caaagtcaga ttaatctcaa gaatataaaa tttagcattc aaaaaatcag 10920 

tgttacataa tgcattaata gaataaaaga caaaaaccgc aaaaagccac cacgaatgcc 10980 

tcaatagatt aaaaaattaa caaaatccaa tactttttat gataaaaata ctcatccaat 11040 

gacgaataga atggaatcac tcccccacca aataaaaagc tattagaatt aatgaatgag 11100 

ttaagcaagc tcgaaggata caaataatat acacaaatga acagacactt caaaataaga 11160 

tgtacatgcg gccaacagtc atatgaaaaa atactcaaca tcactgctca ttagagaaat 11220 

acaagtgaaa accataatga aatgccccct tacaccaatc agaatgacta ttaacaaaaa 112 80 

gtcaaaaaaa taaaaataac agatactaac gaggttatgg agaaaaagaa atattaatac 11340 

actgttggtg tgagtgtaaa ttagtttaac cactgtggaa cactgtggtg attcctcaaa 11400 

aacctaaaaa cagaaatacc attcaaccca tcaatcccac cgtgggtata tacccaaagg 11460 

aatagaaatc attctatttt aaagacacat gcacacatat gttcattgct gcactattca 11520 

cagtagcaaa gacatggaat caacttaaat gtccatcaat agtaggctgg ataaagaaaa 11580 

tgtggtatat atacaccatg aaatgctatg cagccataaa aaaagaacaa gatcatgttc 11640 

attgcaggaa catggataga gctggaggcc attatcctta gtaaactaac acaggaacag 11700 

aaaaccaaat actgcatgtt ctcacttata agtggagcta gatgatgaga acacatgggc 11760 

acataaaggg gaacaacaca cactggggct tattagaggg tggaggctgg gaaaagggag 11820 

aggagcagaa aaaataaatt ataggtacta ggcttaaaac atgggtgata tggttgggcg 11880 

cagtggctca tgcctataat cccagaactt tggaaggctg aggtggatgg atcacctgag 11940 

gtcaggagtt caggaccagc ctggccaaca tggtgaaacc catctctact aaaaatacaa 12000 
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aaattagccg 


ggcgtggtgg 


tgcatgcctg 


tagtcccagg 


tattegggag 


gctgaggcag 


12060 


5 


gagaattgct 


tgaaccctgg 


aggeggaggt 


tgtagtgagc 


tgagatcaca 


ccactgcact 


12120 


ccagcctggg 


tgacagagca 


agacteggtt 


tcaaaattta 


aaaaaaaaaa 


aaacctggat 


12180 




gataaaataa 


tctgtaaatc 


aaaccctatg 


acacaatacc 


tgtatgacca 


acctgcacat 


12240 


10 


gtacccctga 


agttaaaata 


aaagttaaat 


taaatatata 


tacatacaaa 


aataaattgg 


12300 




agttctatat 


gctagcactg 


aataatttaa 


aaatgaaatt 


aagacaacaa 


tctcctttac 


12360 


15 


aatagtttca 


aatagaacaa 


aatttatcaa 


aataagcaca 


agacttgtac 


atcgaatact 


12420 


acaaaatgtt 


aaaagaaatt 


aaagaaaatt 


taaacaaatg 


aaaaaaaact 


catattcagg 


12480 




Cl CA. ^ y Cl Cl^J ^-j 


cttaatatta 


otaaaataat 


aatattattc 


aaattqacct 


acaqattcaa 


12540 


20 


cacaatcttt 


ataaaaaatt 


caagcaacct 


tatttcagaa 


atttgeaage 


taattccaaa 


12600 




atcatatgga 


aatacaaggg 


actcagaata 


gtccaaacaa 


ttttgaaaga 


gaaaaatagt 


12660 


25 


ttggtgcact 


tacactttcc 


aattcaaaac 


ttactacgaa 


gctataattt 


taagacagtg 


12720 




tggtattggc 


ataaggacag 


acatatagaa 


caatggaata 


gaattgaata 


aaattgtcta 


12780 




Cl Cl Cl Cl L» CL CL Cl Cl Cl 




taaaatcatc 


taaaaat aaa 


cctctatatt 


tatcractaQt 


12840 


30 


tgatttttga 


caaggataac 


acaactacta 


agtgagaaaa 


gaatagtctg 


tttagcaaat 


12900 




catgtttgaa 


caacttcaga 


ttcacatgea 


aaagaatgaa 


gttggacact 


ttcttacacc 


12960 


35 


aaacaaaaca 


gctcaaagtg 


gagcaaaaac 


caaaatgtaa 


gacctaaatg 


taactaatac 


13020 




ccttctattt 


aggtatattg 


agacctatat 


gtagacttta 


gcatcctgtg 


agcagaacac 


13080 




aaat"1 _ "r"t"a era 

Cl Cl Cl 1 — 1 — 1 — 1 — ClvJ Cl 


tataaaaraa 

C*. ^ C4- C_-l ^-j *w Cl 


aaaaataaaa 


taaaatacrca 


acrccttat 1 1 

>fc^ ^w- W«- Ikrfl V_- V—- 


ttqqtcttaa 


13140 


40 


atttaataca 


taccttccaa 


ttttgttaaa 


cttgggagaa 


ttgtcctcct 


ctgggcttaa 


13200 




agtcattaca 


aaaaagecca 


tagtttgatc 


atttaccaac 


ttgtcccgga 


agggagacaa 


13260 


45 


tttatttcaa 


acatttgaat 


acctaaatag 


aagatatcca 


gtcttcctca 


aaatttctct 


13320 




ttcaaaaaag 


taaggtggtg 


caagggacag 


tatgtggaga 


ttatttcaca 


accctacaag 


13380 




t~acrt~r , cfr , t~aa 


tcaaaaataa 


caaaatctaa 


aaaaaacaaa 


aaccc taaaa 


acrcrc tatqaa 


13440 


50 


actgeaagtg 


tcctttccct 


gttgaaagee 


caacaattaa 


ttactttgtc 


cgtgttaaaa 


13500 




atctggaggt 


ttaagatcag 


tcactgtggc 


tcacacctat 


aatccaagca 


ctatgggagg 


13560 


55 


ctgaggtaag 


aagattgett 


gaccctgtga 


atttgagaac 


agectgggea 


acatggcaaa 


13620 




actctgtctc 


tacaaaaagc 


acaaaaatta 


gctgggtgtg 


ctggtgtgca 


cctgtaatcc 


13680 




cagctacttt 


ggaggctgag 


gtcaaggctg 


cagtgagtca 


tgataacgee 


actgcactcc 


13740 


60 


agcttgtgtg 


atagagtgag 


accctgtctc 


aaaaacttct 


ggtagtttgt 


cattaatgee 


13800 
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ttctgttacc tacactgtat gctcaattgc 
tgacagtttt tgattagaag tttattttat 
5 tgactactat ttgcatggag tatttcatcc 

gctaaagtga gactcttata gcatatagat 
tgattgaaga atttaattcc tttatattta 

10 

gtgccatttt gttatttggg tggggggggg 
atctcttgtc tccctttgtg gttaggtact 
15 tatctttcat gtatctgctg cagttctaca 

aagggcatgt atgacataaa cctctggtga 
gtgcacaaag caaagagaga gagatggggc 

20 

tgccatagct taccctctcc catgataatg 
ctgggtgtaa ttacctctga aaggtcacac 
25 tttcacatga gttttagagg ggacagtgaa 

actcatgttc ttctcacata caaaatacaa 
tacccattct agcatcaagt ctaaagttct 

30 

tgagattcaa ggtatgactt atctggaggc 
aaaaccagtg atttgtttcc aaaatataat 
35 gttcaaaaga gagagccagg caagaaaaag 

caacagagaa aacagaatta tgttttaaat 
ctgaatcctg ggagtgctga ttggggccct 

40 

ggactcagtc cgtgcttcag ctctgttagg 
ctgagatctc aaaggtagcc ccactcccaa 
45 tctctgtgtt ggctctgact ccgcattccc 

aagtggctcc actcccatag ctatatgtga 
gcccctacat ctcttgcagt ctgtgagcct 

50 

ttgtgccttc caggcacagg tattccaggc 
tcagggtctt gcaaaactgc tgtggcatat 
55 gatgcaggtt acccacaaag ctaaggctgt 

ccatggagca gggatggaat tgagactgat 
tccagggaaa gaggggtgct ttggaggctt 

60 

tttggttcca gagacaatcg ggcatagcag 
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tcattaaatg 


atgaccttct 


ttgcctcttt 


13860 


ctgacctaag 


tacaggcagt 


ccttctcctt 


13920 


cttcactttc 


agcctgtgtt 


ttttcttaaa 


13980 


ctgttttttt 


ttttaaattc 


agccactctg 


14040 


aggttattat 


taataggtaa 


gtagttagta 


14100 


ttgtttttgt 


agatatttgt 


ttatttcttc 


14160 


tttatgaagt 


actaagcttt 


taattttttt 


14220 


gttcttatag 


ttctagaggc 


tgggaagtct 


14280 


gggtgatccc 


atggaaaaag 


atggaagtga 


14340 


caaatttatt 


ctttatcagg 


agtccactcc 


14400 


acatttatcc 


attcacgagg 


atgaaggtct 


14460 


ctcttaaaac 


tgttacaatg 


gcaattaagt 


14520 


atcatagcat 


ttcatccacg 


accctccaaa 


14580 


taattctgtt 


tcaatagcct 


caaaagtttt 


14640 


aagtccagag 


tcatcagaag 


cagatatgga 


14700 


aaatgtttcc 


agctgtgagc 


ctgtgaaatc 


14760 


ggtgagatag 


gtatagagta 


gactgtctca 


14820 


gcataactgg 


tttcaagtga 


gtccaaaatc 


14880 


ctggagaata 


atttcctttg 


actccatgtc 


14940 


cagacatgac 


cctgctccca 


tatctttgct 


15000 


ttggagttgc 


atgctagtgg 


ccttatagtt 


15060 


ggctccacta 


ggcatatccc 


taattgggac 


15120 


actcagcttt 


gccctaatag 


aggctcttag 


15180 


tattctttga 


tcctaggtgg 


agaaagccat 


15240 


gtacacatgg 


attccaccaa 


ggtttacagc 


15300 


atagtggcag 


tgcgagctga 


ggtcctaggc 


15360 


ggcacaggtt 


cactcaccca 


gttgtggctg 


15420 


aac t c taaaQ 


cacacctcaa 


taactcaaQc 


15480 


tctggggtca 


gtgcacaaca 


tcatcaaggc 


15540 


gagctctggg 


gagcaggtta 


ctgttgaaat 


15600 


caattccaac 


cctgggggat 


ggagtgccac 


15660 
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atagtgttaa ctctagaccc tgggatggta 
ctggatgtag cagcagcaag gcctcaggaa 

5 

ggtttcttgg agcaggtgca gaaatgaccc 
gactccaggg gacaggtcag ttccggggag 
10 gtggggtgac gtagctcaat caaagctatg 

ttgagtttgg aggggcaggg tgcaccagcg 
ggagcctggc ctcagggagt agagagtaga 

15 

gcgtgtgcat agtgtagtgg tggttaagtc 
ctcctgaagc agaatgcact ctagccagtg 
20 caactcaggc catgggggcc agggcacagt 

tgtggacttc aggtagctct gtcagctgag 
cagtagtaag actgcaggtg tctatggtgg 

25 

ccttttccct gtaggaagaa ggccctcctg 
gtgatggaag aaaggtgttt tctctccttc 
30 tacaagattt ctgctactcc tttgctgttt 

gaatgtagtt gtttattcat ttttggtgtg 
tgtgtgtgtg tgtgtgttga tggcgatgag 

35 

acattctcta gaattctgaa gacatattat 
taaactttct tctattgtga ccattttctt 
40 caattttttt ccagttgact aattctattt 

caaaattttt tctgtcaaca ttattttata 
catattgtgt cactttttgc actttaactt 

45 

taaattatcc atacttttta catattctat 
gtgggacttc ttggaggctt gtatcaagca 
50 taagacttat tcagagacta attgctttct 

tactactaac tttgatagtt ctcagaacta 
tgatgcattt aatttctagg attatgatca 

55 

cttatctttg ataagtataa tctgaaatct 
gtttctacat tttcccccaa atttatcctt 
60 cctgacttat agagatttta gttgtttccc 
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gggcatggca 


atatcccagg 


ctgtgtaagg 


15720 


tgactaggcc 


tgtctgtagc 


tttttccttg 


15780 


cctggggagt 


cagggtttct 


cagaagcttg 


15840 


gtggggcact 


gcagctattt 


tgcctggatg 


15900 


ggtgcccaga 


gagtggagca 


cagtttcaac 


15960 


actaagatgg 


gggcatggag 


cagttccatg 


16020 


tagcagcagt 


ttggctcagg 


tatggcttcc 


16080 


tcagagatgg 


aggggtgcaa 


tggctactca 


16140 


attcagttcc 


aagaaggggc 


agcacagtag 


16200 


gtcagctcct 


cctctgggat 


agaacagaca 


16260 


ctcagagcct 


gtgaagactc 


tatgtgtctc 


16320 


tgatgggagc 


tgctgggatc 


ctgtcactaa 


16380 


actcagagct 


ggttctaact 


gggggaaggg 


16440 


ttcctgcagc 


cattctgggt 


ttctgtgctc 


16500 


tctggagctc 


ttccttagtt 


attttggttg 


16560 


tgtgtgtgtg 


tgtgtgtgtg 


tgtgtgtgtg 


16620 


gttagtaact 


tctaattggc 


catcttgctg 


16680 


agacctttta 


attctaaact 


tctaatcttt 


16740 


gtttttctgt 


agaattcttg 


gtgatttctt 


16800 


tcagctattc 


cttgttattt 


aacacattgg 


16860 


tttgaaattt 


gtttgagaca 


atgttgcaaa 


16920 


ttcatttttg 


tgatttcatt 


ttcaattgaa 


16980 


atgtgatata 


tatccagatg 


ctcctaatct 


17040 


tatctacttc 


cttctttatg 


gaatccaatg 


17100 


aagcatgtct 


ttttgttggg 


atttctcctt 


17160 


taattctcca 


gggaagtgat 


aggtatcagc 


17220 


ataatttatg 


ctgtacacct 


caaggagact 


17280 


gtggccatag 


tgtcctgttt 


ctttgatact 


17340 


atgcaatctt 


tgtttttaac 


actcatttca 


17400 


actttcataa 


caaaatggag 


ttctaattct 


17460 
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gtttgtccaa 


attgtttttg 


gaacttttct 


agatgagata 


agaataagaa 


cattgacttt 


17520 


ttggttacat 


tcttacattt 


aagtccttat 


cttaactact 


tcaatcacat 


ttacattatt 


17580 


tatttcttat 


tattaattat 


atttatcttt 


cttttaaaaa 


tacttaactt 


cttattttat 


17640 


aatttgtaat 


tttgtacttt 


ttattttact 


gcttttcata 


attttgaatg 


tatacattac 


17700 


taaaattaat 


gctctaaatg 


attaatcttt 


tattttcatt 


ttaacttttc 


ccctctagct 


17760 


ttctctgaaa 


ttcaccctcc 


ttctcccaac 


tttaatcaat 


tttattgcta 


gatagctaat 


17820 


aatgtatttt 


cattgatcta 


tcagagataa 


atacacgatg 


acaattattt 


ccacagaaat 


17880 


ctgaattttt 


tctgtacata 


gcaaatttgg 


actacaataa 


cttttttttt 


catttatgtt 


17940 


accaataaaa 


gggcacacat 


ctccaaacct 


gacagttttt 


agcatatatg 


ctccactgaa 


18000 


atcaactatg 


acttgaagga 


tattagcaag 


cttcacttcc 


ctaaaatttt 


atggaagtgg 


18060 


gccatgaaac 


atctgaatat 


caattcaaat 


acataggtaa 


aaatttccat 


gtggctttag 


18120 


cctatctcca 


cattacttgg 


ttggaatcat 


tatactgagg 


agagacagag 


gcataaaagt 


18180 


aatatctttg 


tgcttggaat 


ttcaaaaaat 


ggctttatgt 


attcttgcta 


tgcaccagca 


18240 


agttagaatt 


taaagcactg 


taataggaaa 


agagatttaa 


atttgtggtg 


ccaagaatat 


18300 


gactcagagc 


tgtaagtttc 


attcccagag 


ttcagagctt 


atgggacaaa 


atacaaggtt 


18360 


acatgggaag 


attttcagcc 


ctgagaattt 


agctgtgctt 


tggttataaa 


aggaggaact 


18420 


atacaatact 


gtacaataaa 


ttggaaggca 


gatgaatttg 


tgtagcttat 


tactattata 


18480 


aaagtaatcc 


ttcatgggga 


agtggctgta 


atcaagagac 


taggcattaa 


aagggataga 


18540 


aataaaatct 


gagcagactt 


gggatttgaa 


gttgaagaaa 


tttctctaat 


ttacctcact 


18600 


tctgtcaaca 


tgtcctctct 


cctagcatat 


tattgatatc 


atataacatt 


taagtctctc 


18660 


tctctctctc 


tccacccccc 


acttaatgtg 


ttgcctcttc 


cacttcccta 


gctctaacaa 


18720 


ctattataat 


attggagggg 


aatgtagacc 


aggttttgtc 


cttagggagg 


aatctgcccc 


18780 


aaagtgtcaa 


gactttcttt 


tgtcatttcc 


aggacctcat 


cctaaattac 


atttcttatt 


18840 


gtagcacctt 


ttctgctgcc 


atgacaacca 


tgcaaggaat 


ggaacagacc 


actccagggc 


18900 


ctggccctga 


tgtgccccag 


ctgggaaaca 


tagatgtcat 


acattcatat 


ctgtgtaaag 


18960 


gattgcaaga 


gaagttcttc 


aagaggaaac 


ccaaagtcct 


tggggtaagt 


caattgcaaa 


19020 


tctgggagaa 


gaccaatagt 


gttgtaaact 


cacgtttgat 


cagagctaat 


gcccaataca 


19080 


c ]- cif~ c c c 1 a c Y~ 
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acatcaoaat 


ttaatttcaa 


19140 


gccctttctt 


tgaaaaaata 


tataatattc 


tatcacatcc 


tgcttccaaa 


ttaattatat 


19200 


ttaaaatata 


ttgacttcag 


atctattcta 


gggttgtggg 


tcagccatga 


taattatatt 


19260 


taacttcaat 


atagaagaac 


ggcaaaagaa 


aaggcccacc 


tgtccactag 


ctgactcccc 


19320 
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ctccacaggc 
tttcttagcc 
agtctcaggc 
gtgaaagaac 
tcactgtgga 
atatagttac 
aatgtatgga 
aacaattgag 
cttttcaatt 
gttgacacat 
gttctatggg 
atcccaccaa 
aaagtcatac 
ataacatgca 
gtactgaata 
aaggacatct 
gtgtgcaggt 
ttgtattgta 
ttgcaatttt 
ctagcatttg 
atctcattac 
tatgtttatt 
cattttttta 
tgaataacaa 
atcttttctt 
atgattaatt 
aaaaccaagg 
ttcaacattt 
gtctaagttc 
aagactattc 



cttcctgaca 
tactgctcag 
gaacgtgtct 
attaaatata 
ttctttggat 
ttctcataaa 
taattatttt 
agcaaagtac 
atcaacttcc 
cattactcaa 
ctttaacaaa 
tccctggcaa 
agttaacatt 
tttaaatttc 
atatttcatt 
tgatggcttc 
tttgtgtgga 
agagtatgtt 
gaagatttca 
atttcctcaa 
cattttaatt 
taccatctcc 
attgggatgt 
tcctttgtca 
ctgacagtgt 
cggtatttca 
tcatctagat 
gggtctgtga 
aatttttgcc 
ttctaattgt 



gctgatagat 
aggatgaaaa 
aaaaacaata 
ggtttatcac 
tccttctctt 
atatgttatt 
aaaatatact 
aggtatatcc 
cccaccagag 
agtacacaaa 
ggcatagtga 
tcactaatct 
ataaattagg 
ctctatgttt 
gtctagaggt 
caaactttgg 
cataagtttt 
tagttttgta 
gtcaacaatg 
tgttctggat 
ttcatttccc 
atatcttctt 
tagtattctt 
ggtatgtttt 
cttctgcaga 
tgggttgtgc 
tttggcctat 
tttattttga 
tatggaggtc 
attatcttag 



gtatattgct 
atctgctatt 
tagaattaaa 
tagcccaagc 
tttttgcatt 
tatgttggca 
gtatgtttta 
catatgcccc 
tgggacattt 
tcacaatggg 
tatgaatcta 
tctatcgtct 
tagtttttaa 
tcttatggct 
accacagttt 
caattaggag 
caactcattc 
agaaactgcc 
aaggagagtt 
tttttaccat 
tggtgacata 
tcttgaggtg 
aatgttgagt 
ttgcaaatac 
gcagaatttt 
ctttggtatt 
gtcatctgtt 
gttaattttt 
cagtttttta 
ttctttatca 



actctttctt 
gtctgagcat 
acagtgacag 
ttcaaaccac 
attcattttt 
catactaggt 
gagcagtttt 
ttgccccgac 
gttccagttg 
gttcattctc 
cctttatagt 
tcatagtttg 
agactgcctt 
taatagctca 
atttatgcat 
taacgttgct 
gggtaagtat 
aaactgtctt 
cctgctgttc 
tctaataggt 
agatgtgggg 
tcttattaaa 
ttgaatttcg 
cttctgtcag 
ttaattttaa 
gtattgacaa 
aagagtttca 
gtgaaggata 
agtagtattt 
aagatcactt 



tcttgcagct 
ttccttctgg 
gttagggaag 
tgagaaaatg 
ttggagagaa 
tcaattttta 
aggcttaaag 
acatgcatag 
atgaacctat 
tgtgttgtgt 
atcatacagt 
ccttttccag 
ccttcaccta 
tttattttta 
tcacctacag 
ataaatatct 
gattactata 
ccctgttact 
tacataattg 
gtgtagtcca 
catcttttta 
gtttttgaca 
ttgtatattt 
tctgtggttt 
taaaattaag 
actcatcacc 
gagttttgag 
caaggtatgt 
tttttttgga 
gactatactt 



19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
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ctgtgtctat ttctaggctt tctattctat 
aatatcacac agtcctgatt actatcactt 
5 ctcctaaaac tccttcttct tcaagattgt 

tataaacttt agagtcagtt tatcaatata 
attttgagat ggagtctggc gctgtcaccc 

10 

cactgcaagc tctgcctccc gggttcacac 
gggactacag gcgcctgcca ctatgcctgg 
15 gtttcaccat gttagccagg atggtctcaa 

cctcccaaag tgctgggatt acaggggtga 
tgagggagat tgcattaaat ctacagatta 

20 

tgagtcttcc tatccatgga catggaatat 
ttttttaagt ttctcttata gattttgtac 
25 ccaaatacca atacctaaat atttaatttt 

tttaatttca aatttcactt atttactata 
attaagcttg tatcctacaa ccttaatata 

30 

gattctttca gatattctac acagacaatc 
tttttcctta ccaattgtat acctttattt 
35 cttccagtaa atggtaaaag gagtagtgac 

gtgagaaagc tttgaatttc tccattaagt 
attctttatc aacctaaaca agttatcctc 

40 

aaacaggcat tggattttgc tcaatatttt 
tttcttcctt ttagcctgtt gatgtcatta 
45 cagccttgca tatctgggat aaatcccata 

tattagattt agtttgcaca tgttttgttg 
gttggtctgc agttttcttt tcttgtaatg 

50 

tgctagctcc atagaatgag ttagggagta 
tggtataatt tcattttcaa gtgtttggta 
55 tctgttttga aaattaatta attatttatt 

gagtgcctat ttcttcttgt gtgaattttg 
ctcaaggtta tcaaattgtg ggcacagaat 

60 

attactatgg gaactgtagt gatattcccc 
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tcccttgatc 


tatttgtcta 


tttctttgcc 


21180 


tatagtttgt 


cctaaagatg 


ggtagtgtta 


21240 


ttttgttatt 


ctaagtcttt 


tgcctttccc 


21300 


cacaaataat 


ttgctaatat 


atatatatat 


21360 


aggctggagt 


gcagtggccc 


aatctcagct 


21420 


cattctcctg 


cctcagcctc 


tggagtagct 


21480 


ctaacttttt 


gtatttttag 


tagagatggg 


21540 


tctcctgacc 


tcgtgatccg 

• 


cccgcttcgg 


21600 


gccaccacgc 


ccggccattt 


gctaatattt 


21660 


aattgggaag 


aactgatgtc 


ttgacaatgt 


21720 


ctctccattt 


atttagttat 


tcgacttatt 


21780 


atagtttgtt 


agatttatgc 


ctaacaaaca 


21840 


tagggttgct 


aatttaaata 


gtattgcatt 


21900 


agtacataag 


aaagtaattg 


actcttgtgt 


21960 


atcatttgtt 


aattccagga 


gtttttggtt 


22020 


acgtcatctg 


tgaataaaag 


cagttttatt 


22080 


ccttttcttg 


tcttaccatt 


tcagctatga 


22140 


aggcttatcc 


ctgccttgtt 


cctgatctta 


22200 


atgaatttag 


ctgtagtttt 


attaataaat 


22260 


tattaatagt 


ttttcaaaag 


tttttataat 


22320 


tcctgcatct 


gttgatataa 


tcatgtgatt 


22380 


gttacattca 


ttgatttttg 


aatgtctagt 


22440 


tggttatggt 


atataattct 


tttgatacac 


22500 


ggaatttttg 


catctttgtt 


caccagagat 


22560 


tttttgactt 


ttttttttgg 


tagataatag 


22620 


tttcctcttt 


ttctgttttc 


tgtaaagaat 


22680 


gaattccaaa 


cctttccagg 


cctggtgctt 


22740 


tt at t tat tt 


aataattata 


acre t tat tea 


22800 

ted ted \* \S 


acaaattgtc 


tttaaggagc 


aattcattgg 


22860 


tgttcgtaga 


gttattttat 


tatcctttta 


22920 


ctttcactta 


tgagtaatct 


gtgtcctctc 


22980 



-159- 



WO 02/062946 





tctctttctc 


ttagactggc 


taaaggctta 

> 


5 


taaaaaaaag 


cattggattt 


cattgagttt 


atttctgctc 


taatttctat 


tacttttttt 




^ U U U U V_- 


aatttcttaa 


aataaactct 


10 


atatattcat 


tcaatgttat 


aaattttttc 




tttgataagt 


tgttttaatt 


ttttgaccca 


15 


gctcctattt 


taagaacgct 


gtttactcat 


• 


aatatcagat 


ttgctcttct 


ttttttcttc 
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rtacaat aat 


tactttctaa 


20 


caaaagagag 


aaacagttgc 


ggtagaagta 




gctggttctg 


gcatactcct 


tttaaccgtg 


25 


gataatattg 


gtaagataat 


attcatggtt 




tatcaaggtc 


atatgactat 


atcaaatgaa 




yyy 01 y L yy Lci 
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30 


ggttgtgcgg 


attctgattg 


ccttgatgag 




tgcatttagt 


tcttatgaag 


aacatcccat 


35 


gtcagtgatg 


gtaagcaggg 


tattttttga 




aataccctga 


tctttgcaat 


tttattctaa 




y y 


\m# Nw' l*» ^ ^ CL GL w*. 


ctcaattttc 


40 


acttaaagct 


aggcgattaa 


atactttata 




attgccaaag 


acccaaactg 


gaatgactta 


45 


agattagggg 


cttatcattt 


ccagaattgt 




gactgctagc 


agactatacc 


agaggggaga 
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tcaactatca 
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50 


tagactgatt 


aagatttacc 


tctgtagctg 




aaaaactgaa 


gaaaatgatg 


gtttagttag 


55 


acagtttcta 


atatagagtc 


tactgactat 




ttaaattgtt 


atttgataca 


ttttgtttca 




ttaaattaaa 


attagaattg 


gaatttaagt 


60 


gagagttttg 


atgcaaattg 


aaagtaaaat 
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ttgattttat 


ttatgtattt 


ttcttttttt 


23040 


gtctattaat 


tttctgtttt 


taattttatg 


23100 


tcttccgctt 


acttggagtt 


taatttgett 


23160 


caggtgattg 


attttgatca 


ttattttcta 


23220 


taagcactgc 


tttcactaca 


tcccataaat 


23280 


tttataatgc 


catgagataa 


ttgtactcca 


23340 


gttcatgtcc 


tcaggcccag 


aegtcaagaa 


23400 


ccttgtatac 


actccttttt 


cttcaggtta 


23460 


ttccttggcc 


acttttctct 


gagtgtgtga 


23520 


gcatcctagg 


tatcagaaaa 


gcacctctgg 


23580 


aaatgattga 


cagtggtaga 


gatgetgaga 


23640 


ctttatgtga 


aaaaaagagt 


gacaacagaa 


23700 


aaagaataag 


gatagcttag 


tgattggaaa 


23760 


gataatcagt 


ttttgettte 


atcatttcta 


23820 


ccttagcatg 


ggaataataa 


tgatgtgtgt 


23880 


ttttgtgtat 


gttgegtaca 


caatttgggg 


23940 


tataattgga 


tataacataa 


tagaggtgta 


24000 


ttctatcttg 


taattctgta 


aatttgagaa 


24060 


tcatatttaa 


aatagtaata 


ataatataaa 


24120 


taagtcagtt 


aggatatatt 


acagcaacac 


24180 


agcaatgaaa 


gcactgattg 


tttcacaaca 


24240 


ttaatttggt 


ggtgtaagga 


tttcatcaag 


24300 


tggtagacac 


atctcatttt 


tgataaatat 


24360 


gtaagggtaa 


tggacttgee 


attactggct 


24420 


ggaggagtct 


acactacctt 


ttagctcagg 


24480 


caaggaaaaa 


gagatgtggc 


tggtaaacag 


24540 


caaatctcag 


gaaattgtta 


ccttagaaaa 


24600 


taagtttaat 


aacagtgact 


atgattaata 


24660 


ttaaaaacct 


tttcccaaga 


gaaaacacaa 


24720 


tgagaataaa 


ataccaagtc 


ttacctattg 


24780 
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acaaaagaaa aataaaattt atgaggcggg 
cattataggg tttccattta tttccaaaac 
5 agagcttaat ttttgctccc cttcactcgt 

gtgtattagt cagggttctc tagagggaca 
aaaggaagtt tattagggag tattgagtca 

10 

ccatctgtaa gctgaggggc aatgaagcca 
tctgatgttt gagggtagga aacatgcagc 
15 ccagtctagt gtttctatgt tcttctgcct 

tagattgtgc tctcccagat tgagggtggg 
ttaatctccc ttggcaaccc cttcatagtc 

20 

atccaatcaa gttaacattc aatattaacc 
ccacacacat ctcctgaagt catacatgat 
25 caaaattatg cctaacataa tgcagctatt 

tatacaaatg ctattaccta aagttaacaa 
cttatgtcac atggtaaaga aaaaagaaag 

30 

aatgtataca tgcacaaaca tgttcttaac 
cggtcttcat ttctgcaact gatcacatgg 
35 taccaccgat tctgtattcc ctttgccttc 

ttttcctggt ggagtgacca aaaccttcat 
tcctgcccag attgggttgt tgtagtttcc 

40 

ctaagagaca ccctaaggga tctcctgtat 
agtagtagac taattttatc ttgatagtcc 
45 cccttcttag ctggttgact tagaggtaag 

aacttccagc ttaatgaaat cattgttgtc 
taagacctct aggccagcag aacatagtgt 

50 

taggtcacta ggggtgatgg taagtggtgc 
cccacaaatc ttggctatag cagaaacagt 
55 cacagccttc tggagaacat tgccccagcc 

taattgtgac ttcaaaagga cattccacca 
ggaaacatgg taagaccagt gaattccatg 

60 

caacaagtga gtgccttggt cagaggcaat 



PCT/US01/48437 



tttatgttag 


aatcaaggaa 


tctattctag 


24840 


tgagaatgag 


acagtttaga 


gttttataat 


24900 


gtacagcgga 


agtgaaagat 


ttgaaggaga 


24960 


taactaatag 


gatagatgtg 


tatatatata 


25020 


cacaatcaca 


aggagaggtc 


ccacaatagg 


25080 


gtttgagtcc 


caaagctgaa 


gaacttgaaa 


25140 


acgagagaaa 


gatgtaggcc 


agagactaaa 


25200 


acttttattc 


tggccacact 


ggcagctgag 


25260 


tctgcctttc 


ccagtccact 


gactcaaatg 


25320 


acatccagga 


atgatatttt 


gcatccttca 


25380 


atcaagagtc 


caccccttgt 


caacctgaac 


25440 


tttcaaataa 


agaaaataaa 


gacaaaaggc 


25500 


ttttgtacaa 


cccaaaatgc 


accaatcccc 


25560 


cacttaaatg 


ctaatatgaa 


gttaataagt 


25620 


aaaataaaat 


gaagatattt 


tcttagtaca 


25680 


aaaagaagga 


ggaaatactc 


atgacaatta 


25740 


ttgtagctgg 


tattgatgac 


agccttattc 


25800 


aggaagcacc 


tcagcaggtt 


gtgttttttt 


25860 


tccagaaggc 


ctagtctggg 


ccatttgtag 


25920 


cattgacctt 


aatcataggg 


catggtaata 


25980 


tccacacata 


cttccttacc 


tccattgtgg 


26040 


aggtcagtca 


ccctagccaa 


cactgtaact 


26100 


aggagcccaa 


aatggccagg 


tggcaatctt 


26160 


tcccggtggc 


aacgttcctt 


cctctggaac 


26220 


tgtggaaaca 


ggaagcaaaa 


attttgctgg 


26280 


cacttccact 


tatacccctt 


gatttctgga 


26340 


accttatatt 


agacgctgat 


tcagagcatg 


26400 


c taaaaaat a 


t taticacc tcr 


Qt tciatcrtta 


2 6460 


ttctatcaat 


ccagctgctt 


caggatgatg 


26520 


agaatgagcc 


cactgccgca 


tttctttagc 


26580 


gctatgtggg 


ataccatgac 


agtggttaag 


26640 
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gcattccatg agttcttgga ttgtagtctt 
cccatatcca aagtaagtgt ctattctggt 

5 

aagaggtcta atataatcaa cttgccacca 
gctatattga gggctcagtg ctggtctttg 
10 catagccagg tcagtgaatg gaagtccatg 

gccaccatgg ccacttcgtt catggactca 
aggctgactg gtgtccatag aatgagtcat 

15 

tgctgaggtc accgtttgat aagcactcac 
ccactcggag aggtccattc acatacctct 
20 atcatgcttc ttccaagtcc ctgactactc 

agtatataat tgcagatctg gccatttctc 
tgctcaacat tctgcccact gggacgattt 

25 

aggaaggggc tgtagtgctg cagctgacca 
gccatctgta aacagggccc ttgtcttctc 
30 atgagaccat cagtgcaggc tgggggagag 

gcatttgagc cacttcctca tataacttac 
cacatatatg ccacttccat ttgatgatgg 

35 

tagacaagtc agaaagcact caattcatga 
gacccatagc caaccgttta gtttctacca 
40 aaaaaggaaa gtagttatct gcagaagatg 

tctgctgtga ttcatctatg ggggacttcc 
ctgacacctc aagcacaatt agatctgctc 

45 

gcatagcagt ttggacctgt tgcagagcct 
aagcctttca ggtctcttga ttaatgggcc 
50 cctccaaaac ccaaataagc ccactaggca 

gggcaaatac agcaacttat ccttcacctt 
ctgaacccct agaaatttta ctgaggtaga 

55 

ccatcctctg gcatgcaaat gtcttaccaa 
actggatccc atcagtataa tggcatcaat 
60 ggaaaagtga tcaagatctc cctgaacaag 



PCT/US01/48437 



gccagaagca 


ttgtgtgcag 


gataggcaaa 


26700 


gaggaaaaac 


cgctgccctt 


tccgtgatgg 


26760 


agtagctggc 


tgatcattcc 


gaggaatggt 


26820 


ctgctggcaa 


attaggtgct 


cagcagtggc 


26880 


ttgctgagcc 


catgcatatc 


ctccatccct 


26940 


ttgggcaatg 


acaggggtgg 


ctggagaaag 


27000 


cctgtttact 


tgattattaa 


aatcctcctc 


27060 


atgaaataca 


aatatcttca 


cagtttttga 


27120 


tcccaaaatt 


tatttgtcac 


caattttcct 


27180 


agccaaacca 


ttggctatag 


cccatgaatc 


27240 


cttccaagca 


aagtgcacaa 


tgaggtgcag 


27300 


tccttcaccg 


ctgtccttca 


gggatgtcct 


27360 


ctttgaggtg 


gtgcctgcat 


atcgtgcaga 


27420 


ttcctgttaa 


ttgatcctgg 


ggaactccct 


27480 


aaggcagtgt 


ggcgggaatg 


gggatcatga 


27540 


ttgtgccctc 


aggagttgct 


caagcctgat 


27600 


aatgctgctg 


tgcatgcccc 


actttatggc 


27660 


taggtagttc 


aggtcgcatg 


gtgacttgat 


27720 


aagtccagta 


acagggcaag 


agctctttct 


27780 


gcagggcctt 


cctccaaaat 


cctagaggcc 


27840 


aaaggctcct 


aacagcatcc 


ctctctgctg 


27900 


tgttctatgg 


cccaagtgac 


agagcaactt 


27960 


tctcctgtcc 


tggaccccac 


tcaaaactgg 


28020 


aagtaacacc 


cccaggtaac 


gaatgtgtgg 


28080 


ttgtgagtct 


ttcttggttg 


tagggggtgg 


28140 


agaaggaaca 


tcttgacagg 


ccccacacca 


28200 


aggtccctga 


attttagtca 


tatttatttc 


28260 


taagtccagt 


gtgcttgcta 


cttcttgctc 


28320 


ataatggacc 


agtgtgatat 


tttgtggaag 


28380 


gtcatgacac 


aaagccagag 


gagagttgat 


28440 
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atacccctga ggtaggacag taaaggtaca 
cttctggtgg gccttataga caggaatgga 
5 ttaccaggta ccagaagatg ggttaatttg 

aactgcaact ggagtcacta cttggttaag 
tccatctgtc ttctgcactg gccaaatggg 

10 

cacccttgca tcttttaagt ccttgatggt 
gggatattgt tgttgattta ctgcttttct 
15 ggccttttcc accatagtag ccttcactct 

cagctgctaa gcatgtctgt gtcaattatg 
tgagtgtggg gacccactgg agccactgta 

20 

acatgacctc cataagcccc tactataact 
ggaatcaatg tcagctcaga gtcagtgtcc 
25 ctttctttcc ccaatccagt tacctggtaa 

gagaaagatt aacaacataa attgtcagta 
ggcctccttt tcattcaagg ggttctgggt 

30 

agtggggctg cgattctctg tttttataat 
aagttttctg cttatgtaaa ttaagtaaga 
35 taggaacgct gtaattaatt agccaatacc 

tgctgctttg cctcttctgt ctattatggt 
gtgctgccat ttggcccctg ctgtctctag 

40 

gtagttgagt gactacagtt tctactgtaa 
agctcttcaa ggatgcaagt gctgccctct 
45 gtatatcttc tgaaccctcc cagctgggat 

tcatcacaat ctgcctaagc ctttggatcc 
cattttcagc tcacccacag tgggccatct 

50 

taaactagaa cttttttaac tctgcaagct 
gggcccaaat caataaattc agcctgatcc 
55 cccttaatat tcattcccat gcctgttttc 

caagcagttc ttttcaagtg tagcacactt 
taggggcctg ctgggacttt agttataggt 

60 

tcctaaggag aatcaacatt gtcttgcttg 
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ttgctggtct 


tgccagctga 


aggtaaattg 


28500 


gaaaaaggca 


tttaccaaat 


caatggctgc 


28560 


ctcaagcaat 


gaaaccacat 


ctggtacagc 


28620 


ctcatgataa 


tccactgtca 


ttctccaaga 


28680 


agagttgaat 


ggggatgtgg 


tgggaatcat 


28740 


ggcactaatg 


tctgcaatcc 


cttcagggat 


28800 


aggtagaggc 


agctctaatg 


gcttccattt 


28860 


atgagtcagg 


ggccagtgtg 


ggggtcctga 


28920 


cattctggca 


ctggggaaat 


gaatgcagga 


28980 


agttgaacct 


gagctaaagc 


tccattaatt 


29040 


ggaggatcac 


aatgactttt 


tgggtcccct 


29100 


agtagttccc 


aaaaggtctg 


atcatttttt 


29160 


aaggccagag 


gtctccttga 


ggaaggatgg 


29220 


gtatagtggg 


atccttcttt 


aagaggacct 


29280 


ctgtaaaatg 


gcatatgtct 


ggaaattgat 


29340 


tcaaattagt 


gttttgtcca 


ctcgacctgg 


29400 


attcggtaag 


cttcctgtca 


atttcacttc 


29460 


agagctctac 


acaagtcaga 


ctattctgat 


29520 


agctatgcct 


acctttcctt 


tgatggctga 


29580 


atccaattat 


ttttattgca 


tttaaatttt 


29640 


gatctgacat 


acagagaaag 


gcaatcacag 


29700 


caaatctatt 


tcacaaagta 


ctggtcaagg 


29760 


gaataggctt 


aaagtgacta 


atccactcct 


29820 


cttcctctac 


attaaaccaa 


ggagatcagg 


29880 


tttaatccat 


atttcagcta 


accaagcaaa 


29940 


gcaacattaa 


atgcagaatc 


cctgcttaga 


30000 


aactctatgt 


tcttccacca 


ttatcccaca 


30060 


cagatt t ctg 


ct tatataaa 


ttagaaaact 


30120 


tcttgtgtat 


cgcattctga 


acctcacctc 


30180 


ctagaagcaa 


acaggggcct 


tgggagtggg 


30240 


gcaacacctg 


aggagaaggc 


catcactatc 


30300 
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ggggaacctg 
ttgagaaata 
aagacatcac 
ttattaggga 
gtactgtaca 
cacagtaccg 
acatcttatc 
ggaatttcct 
ctacagcctc 
caggcgcgta 
tctcccattt 
tcagaattta 
caaggtgccc 
caattattac 
aaagtaaaga 
tgaaatcttt 
agtattaatt 
catggcttca 
cattaggcag 
gttagggagg 
aggctcctga 
cccattccaa 
cacctggtga 
ttgtgtctga 
ggcaatctta 
gagttcataa 
aactccatcc 
tcagggttct 
ctaataggat 
tagtctttcc 



ccccgatagt 
aagggacaga 
atgtcgatag 
gtttcaaaaa 
aggtaataga 
gggtgaaatt 
aggagacagg 
cttcctaata 
gaccataaga 
ttctctttcc 
gcttttgaaa 
aggttctctc 
agatttcata 
agggtcctga 
cgggcatagg 
acaatttatg 
tggggaactc 
gccggtccct 
cgcagggttt 
ggatgttgcc 
gagtttacaa 
gttgcagggt 
gtctatgcat 
gtgtccacaa 
gaagatttga 
ttttctttca 
ttaatattct 
ctagaggaat 
agatgtatac 
atgtttttct 



cacgtaggtt 
gtacacaaaa 
gttccgtgac 
gggagggagt 
atatcacaag 
aaaattgcta 
gttttgagag 
agcctgggag 
gaccggcaca 
cagggatgtt 
gaagagaaat 
tcttattccc 
ttgtttaaac 
ggcgacttac 
aaatcacaag 
tttagagatt 
ataaatgtcc 
ccgtttgggg 
atctccttag 
actactggtg 
gctcagtgtc 
cccattcttt 
gcaccttttg 
tttcagctct 
ggctcagtat 
tcactttgtc 
gtttttctag 
agaactaata 
ataaagggca 
gcctcctttt 



cttttctgtt 
gagagaaatt 
accccacaag 
atgcgaatag 
gcaagtggag 
atgaagtttg 
caactggtct 
cgctatggga 
tctggggggc 
ccttgctgag 
atggctctgt 
tgaataattg 
acacatgctg 
atcttcctca 
ggtattgatt 
gcagtaaagg 
atgaaatctt 
tccctgactt 
acaaaggtga 
atggggaagc 
cccagcttca 
ttcaatcaat 
ttgcaggtca 
ttctctacag 
ctgcttctga 
cagtgaactt 
atcatctcct 
ggatagtcag 
gtttattaag 
attctggcta 



ttccctaagc 
ttaaagctgg 
ctgcaaaaac 
gtgtgggtca 
gcagggcgag 
aggcaccatg 
gaccaaaaat 
gactggggtc 
catttatagg 
aaaaagaatt 
tctgcccggc 
ctgttatcct 
tacaatttgt 
gctgacagga 
gaggaagtga 
caggcataag 
cacaatccac 
cccacaacac 
aaaggctgat 
tgttttttct 
tcaaggttct 
gctctcgctt 
gcaaatcgca 
aagataagac 
agcttagagt 
aggagcaaaa 
ggtaccaaaa 
ggttctctag 
accagaagac 
tgctggcagc 



gttggctggc 
gcatccaggg 
cagtaagttt 
cagacatcaa 
atcacaggac 
gtcattgata 
ttattaggcg 
tatttcaccc 
cctatacccc 
cagcaatctt 
tcaccggtgg 
gttctttttt 
gcagttaatg 
ttaagagatg 
taagtgtcca 
aaattataaa 
attcttctgc 
atcactgttg 
ggcagtgtgg 
ggcaaaaaaa 
accgcacatc 
taacagtaga 
tgataaaagc 
tctcactcag 
tagaatccct 
ccaaaaaaag 
tctgtattag 
agggatagaa 
taaatgagtc 
tgattgactg 



30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
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aggatgggtc 


tccctttcct 


agtccaccaa 


ctcaaatgtt 


aatctccttt 


gtcaacaccc 


32160 


tcacagacac 


acccaggaac 


aatactttgc 


atccttcaat 


tcaatcaagt 


tgacactcaa 


32220 


tattaaccat 


aacagacagc 


aaacactgtt 


ccatctctgg 


aaagttgcaa 


tatatatata 


32280 


tacatatgat 


atataaccta 


tatatataat 


atatatatta 


ccaatgtcaa 


taatagcttt 


32340 


ttattaaact 


cttactatga 


tctggacatt 


ctgctaagtg 


ctttttatgt 


gtacttttta 


32400 


tttcacatca 


tcaccacaaa 


cttttgaggg 


aggatttctc 


attatcttca 


ttttatggat 


32460 


gaagagaggt 


taagtaactt 


gctcaagatc 


acaactgttt 


ggtacaggat 


cagggttcaa 


32520 


atagggtcac 


tctattatca 


gaatcacata 


catgtaacta 


cctaggtttc 


cgctttcatg 


32580 


tccattttct 


tttcaagatc 


acattgaaaa 


ccagcttaca 


tggttttagc 


cttccacacg 


32640 


tgttcagtag 


attaccaaga 


actgcaggag 


ctgtgtttgt 


ccactgagtc 


atcactcaga 


32700 


ctcttgtagg 


cattgaacta 


gattggtaaa 


tgcagtaaat 


tggtaaaacc 


cttttttgct 


32760 


gctaccttcc 


attattcttt 


caaccttttt 


ttccgaacat 


cacctgcaaa 


gtgcctctct 


32820 


tttctccctt 


ttacagtata 


agtaccagta 


ccaagagcca 


gctgtaagtt 


tgtattatcc 


32880 


ttggcctctt 


cagttctatt 


cctatttatg 


tttcacattc 


tttgttaagt 


ttttcctctg 


32940 


agataccata 


tacatatttt 


ttcctgtggt 


aaatgaaatc 


ccttttttca 


atcctatctt 


33000 


caaatacaca 


aacatacata 


cacacttcta 


atatgtttat 


ctaatttctg 


tatataacat 


33060 


ttttccaagg 


tgctttactg 


aattttttaa 


agagtttatg 


gtacttttta 


gtttatattc 


33120 


ttggatgttt 


gcaatatatg 


ttattcatac 


ttaaataaca 


gtagttttag 


ccatttgttt 


33180 


caactatttt 


tcttatataa 


ttattttggt 


taatataccc 


caaatagtaa 


tacataatag 


33240 


tctacatgtt 


tatcttgatt 


ttgactttaa 


taagatggtc 


ctactttttt 


tttaatcatt 


33300 


aaacatgata 


aagacttgta 


gactgagaca 


gatgtcctat 


cttgttacag 


aaagtataat 


33360 


tatttatatt 


ttaccaataa 


tgtttaagtc 


actaataatg 


ttggattgta 


ttaaatgtcc 


33420 


tatcctatca 


acactgtaaa 


catgagtagc 


aattaacagg 


taacgttttt 


cttaaattta 


33480 


tttattatat 


atattagaat 


atttcaatat 


taaaaatctt 


tgtagagttg 


aaggaacatc 


33540 


ccaaatgttc 


atgatgtatt 


ttatgttaca 


tgatggatga 


tttgctctac 


tgttattttt 


33600 


tttccaatgg 


tgatttagat 


aacatgaatc 


atctctatgt 


ctgggtattt 


ggaaactaaa 


33660 


gaagtttatt 


gagctatttt 


attagtccag 


tttctttata 


ttaggatata 


atttttgaat 


33720 


catctQaQat 


caaatttqta 


aaggaaataa 


aaaactctaa 


taatggaata 


ttataaggta 


33780 


gagagtccaa 


ccaacttgat 


tagctatttt 


ggctaaatga 


aaaataattg 


ccccattgat 


33840 


ggattcagat 


gaagtagtat 


attctagtta 


tcaaataact 


gggaactaaa 


ttttatgtgc 


33900 


tactatttta 


ttttataact 


ttttaaacat 


attttcaatg 


agaattattt 


tagtacattt 


33960 
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tttgtttatt 


gtattcaata 


ttaataccat 


ataagaatta 


cttcttatca 


ggaatttgga 


34020 


5 


agctttatct 


ttcagtgtcc 


tcaaacagtt 


ttaatatcat 


tccagttagc 


tattatttga 


34080 


aaggcttgat 


aaattatccg 


gtggactcat 


ctggcagaat 


ttttctgttc 


tttctgggaa 


34140 




aagagaaaat 


tatgtgcaat 


ctcttgcttt 


ctaagataat 


tgacctgttt 


atacatatct 


34200 


10 


atctaatttg 


gaataaagtt 


tactaagttg 


tgatttccaa 


tacttgttta 


aaacttttta 


34260 




tctccatttt 


tttctctcat 


tcttatttta 


ttatttcagt 


atccttatca 


gttgcagcag 


34320 


15 


gaattagaac 


aacaaaaggt 


ctggtgagta 


atgttttgtg 


gggggtatca 


aaaggaaagg 


34380 


attaatcaat 


aatgtattct 


gccagggtat 


aaaacaggtt 


ttgtttgttt 


gtttttaaaa 


34440 




aacrt cacfcracr 


caactatQca 


tt t tac ttaQ 


caqaatctQt 


c tcaaaaat t 


acaataaqqc 


34500 


20 


taatgataga 


attactgaca 


taataatgga 


tattttgtca 


aggtagttga 


tgagggaatc 


34560 




tttatgaaga 


tatggaaaat 


tatttagaac 


aaatatgaag 


tgggatcact 


caaaaaagct 


34620 


25 


ctgcagcttg 


ggattcagcc 


catgacagtc 


acaccaagta 


ttggtgtggc 


ataatgttta 


34680 




ttgaccaatt 


gattaaaagg 


tagattctgg 


ccgggcgcgg 


tggctcatgc 


ctgtaatcct 


34740 




aacrac t ttcxcr 


aaaacctaaa 

;3 ;3 ;3 ^ ^ z3 


taacrtcracrt c 


at t tcraacxtc 


aQQaat t tcrc 


aaccaocctQ 


34800 


30 


gccaacatgg 


tgaaacccct 


tctctactaa 


aaatacaaaa 


aaattaccag 


gcgtggtgac 


34860 




gggcgcctgt 


aatcccagct 


actcaggagg 


ctgaggcagg 


agaattgctt 


gaaccctgga 


34920 


35 


ggcggaggtt 


gcagtgagct 


gagattgtgc 


cactgcactt 


cagcctggtg 


acagagcaag 


34980 




actccatctc 


caaaagggaa 


aaaaaaaagt 


agattccaac 


tgtatccaaa 


aaggattttg 


35040 




aattttcaat 


ataaaaaat t 


cacaat aaaa 


t tat t acata 


aaaaQcacrca 


aat ttaaaat 


35100 


40 


gtaactctca 


tgtcaagatg 


cctttcataa 


agccaataaa 


atgcaaaagc 


caagatctca 


35160 




acaagggttt 


ttctggctaa 


acactatttg 


agttttcttt 


gttcaaaaga 


taaatatcaa 


35220 


45 


actctggagc 


atgatcaata 


tcggttttcc 


aaatctggcc 


taacccaact 


tcctagtgtc 


35280 




tgccgggacc 


tgcagccaca 


ctacactctt 


cacactttcc 


tgtgtaattt 


taatacttat 


35340 




aaaaaacctc 


c t tacc t t ac 


ccatcracraacr 


acraaaataca 


aaaacrcjaat a 


cacaaccaaa 


35400 


50 


aaaaataatt 


taaaatgtag 


agaggcatga 


gaggcaagca 


gtgtttagaa 


caatgataag 


35460 




agatcgggca 


ccagaagcaa 


ggaatggcag 


ggaaggacac 


agacatgact 


gtccagctaa 


35520 


55 


tgtgtaaaaa 


atcattgaaa 


accctgataa 


atgatttgga 


gattatctca 


tcttctcaca 


35580 




ggaattatct 


gaagaattta 


agctggagaa 


taatgggatc 


atatttgtgt 


tttggcgaga 


35640 




gatcagagtt 


gaaaccacta 


taaggagaca 


agttagaaga 


ctattgaaat 


aagataaagg 


35700 


60 


tagacagcag 


tgtaaagtaa 


ttaacagaga 


tatatctgag 


gatgatgttg 


gattgcttaa 


35760 
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ttaattttca tttctttcta ggttggaggt 
ttggctatat cagggatctt aatcaatgca 
5 cattactgta accacgatca gttgtcaagt 

gtgagtgttt tctcttttca tgagatatta 
ctggcaacaa caataattat catttcctaa 

10 

ttgagaaata gaataatctt tcatttagga 
ttttcctcta tatgtgatga ggcaaaatgg 
15 tccttaatag ttactgggct gggcattttg 

tgatgcagtg tctactgtta tctcctccag 
gtcccagccg gacacggtgg ctcacgcctg 

20 

gcggatcatg aggtcaggag atcgagacca 
actaaaaata caaaaaatta gccaggcatg 
25 ggaggctgag gcaggagaat ggcgtgaacc 

agtgccactg cagtctggcc tgggcgaaag 
aaagatgtca gtgcctattc tcaaataata 

30 

agtggtaact cagtatcctt tcccacatct 
atcctcaaca tgtggcacct gaaaaattct 
35 ttgtaccagt tgtccttatt tcaatgctat 

tacacaacat tttgtaagta ataaaagttc 
gggaagagtt cttgcagcac atcacagcag 

40 

attattatta ttattgttgg agatggagtt 
gcagttcaca atgcacgtgg tggcacaatc 
45 tcaagtgatt ctcctgcctc agcatcccaa 

gcctggctaa tttttgtatt tttagtggag 
cttgaactcc tgacctcagg tgatccaccc 

50 

ggtgtgagcc actgtgccca gccaaatctt 
acaccaccaa actttcagtt acttttcctc 
55 cttagttctt ttcctcactg tgcccttccc 

ctaacttact agacagaatc aactgcattc 
agtcttgagg gctacgtagt cacaccaaaa 

60 

tttacttccc ttgagatatc taatgctttt 
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agtctaggaa 


agaatatcac 


cagttcagtc 


35820 


ataagcttga 


cgttttattc 


attccgttac 


35880 


aattgttaca 


tgactatgtc 


cattttaatg 


35940 


tcatagaagc 


agtttagggg 


agtgctggct 


36000 


ttgctgaatg 


actttggaag 


ttgtaagtga 


36060 


gtcaggtaca 


catctcatct 


ctgtctttat 


36120 


gaagaaagct 


tatagttata 


gaacacctac 


36180 


cataaatttt 


tctgtttaat 


tttaaaacat 


36240 


acccctctaa 


aaaatggcgt 


caaaaagtca 


36300 


taatcccagc 


actttgggag 


gccgaggcag 


36360 


• 

tcctggctaa 


catggtgaaa 


ccccatctct 


36420 


gtggtgggca 


cctgtagttc 


cagctactcg 


36480 


caggaggtgg 


agcttgcagt 


gagccgagat 


36540 


agtgagactc 


tgtctcaaaa 


aaaaaaaaaa 


36600 


aagtaaaatt 


gtcttgaaca 


gtattttata 


36660 


ttggatgttc 


ttgcaacgtt 


aaccactttt 


36720 


caatctccat 


tctgaatttt 


actacttttt 


36780 


ctcaattgct 


ttattctgaa 


aattaccact 


36840 


ttgatgtttt 


gggccattgg 


gtaagtgtgg 


36900 


actaaaatgc 


acaatttaat 


ctttataatt 


36960 


tcactcttgt 


tgcccaggct 


ggagtgcagt 


37020 


tctgctcatt 


gcaaactcca 


cctcctgggt 


37080 


gtagctggaa 


ttacaggcat 


gtgccaccac 


37140 


acggggtttc 


tccatgttgg 


ccagcctggt 


37200 


acctcggcct 


cctaaactgc 


tgggattaca 


37260 


tattattaaa 


aagcctactg 


gtcattttca 


37320 


tctatgaact 


ttcagccttt 


cgccttgctg 


37380 


tcatgtgtat 


qtqqaaactc 


aacactcttc 


37440 


tgtttacttc 


atttctagct 


aatcccacat 


37500 


taaccgcttg 


aggtgtcttt 


gcgaaactaa 


37560 


tatagtaagt 


aattttattt 


cttgccactc 


37620 
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actcaagata agaaaagaaa gtttcttttg 
agttagaaat tggcaaaagc aagattcaat 

5 

tatttgctat tgctatcccg aagaagaacc 
catagacttt aatggcacaa tgatagagac 
10 tgatttcgtg ggcatgggac ctgtgcagtt 

tgcttggtct aatgctctgc tgttgctatc 
acactctgta aattacgtag ctggttctgc 

15 

tgatgaattc ctattaactg tgtgaacatt 
cataagagct ttctggaatt tgaagcaatg 
20 cagtccacag gtctatgagc ccatagtgta 

acataagaaa gagaacagtt ccctaccaca 
atagaaatgc tggaaagcat ataattatta 

25 

aaagtagtta gagtctacca cccggaatca 
tcaactatac ttaacctaga aagacagcag 
30 taattttaat aagagagtct ataaattaca 

ttcccagagt ccatgatgaa ttcctgttaa 
agcaggggac ctccataagg gcttcctggg 

35 

gcacctgcac aatcagtcca caggtctata 
gtatttgttt ttgtaaataa agttttattg 
40 gtgtattgct gcttttgtgt tataataaca 

gtctgcaaaa cctaaaatgt acattatctg 
ctacacttct agatggtaaa atccttgaag 

45 

ctatattctg tgcctacaac aatgtttgaa 
caagaaaata aaaaagcaat gaatgcaaat 
50 tcatatcctg agctcttggt ctattggctc 

ggagagaaaa agagagaata tttaaaccag 
aatagaaata agtgtactgc ccaattgttc 

55 

gactagccct aaagccagtg ctctggggag 
attgttgtat tagcttctta ttgttactgt 
60 catataaaat catttagtgt ctcaagacaa 
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agaatttgac 


aaattttccc 


aatgtcactc 


37680 


tgtagaactt 


cttgtctcta 


atttccctct 


37740 


ttacttttag 


tttatgaggc 


tctcttcatt 


37800 


aaaatctgga 


attggccagg 


ttcatagggc 


37860 


actcaggacc 


tcagagttag 


aagagccctg 


37920 


ttgaaattat 


ttttaaaaat 


tttaaacaag 


37980 


attcatctca 


atcacctttc 


ccagaggtca 


38040 


tttatgcctg 


ttctaatagc 


agaggacttc 


38100 


agtctcttct 


tgcctgggca 


cctttacatt 


38160 


tgccccatca 


cagggttggc 


accccggtag 


38220 


aaggaacttt 


ggttctcctg 


gaagaaaaag 


38280 


aagaggccag 


aaaaagttga 


gctctagaaa 


38340 


ttaggcagaa 


ccatatgaaa 


ttgccaatga 


38400 


tttcgtatag 


ctcaagctaa 


gatatatatg 


38460 


tagctggtcc 


tgcattcatt 


tcagtcacct 


38520 


ttgtgtaaac 


atttttatgc 


ctgtttttat 


38580 


atttgaagca 


acgggtctat 


tcttgcctgg 


38640 


tacatgctat 


gtttttggaa 


gtccagccct 


38700 


aaatgtaacg 


aacccattta 


tttacatatt 


38760 


ggtttgaatg 


tgtgtgccag 


agaccttacg 


38820 


gcctttacgg 


aaagcttgcc 


atcttctagt 


38880 


acaggggaat 


tgtctgtttt 


gtttgctgtt 


38940 


acacagaggc 


cgctatagct 


gttgaaggca 


39000 


attctcaagg 


tctccttaag 


tctgcctcac 


39060 


ttccgcaaca 


gagggactat 


ccctaaccat 


39120 


ataatcgctg 


catcagtccg 


gggtaaaatt 


39180 


tccaaatttt 


gactatgttt 


cttggctgaa 


39240 


atggattaaa 


gactgaccta 


cttgatgttc 


39300 


aacaaattac 


tacaaatcat 


taggcagaac 


39360 


tacagtttgg 


ttatcttctg 


tttctggagg 


39420 
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tcagaagttc 


aaagtgggtc 


tcactgggct 


aaaatcagtg 


tgggcaaggc 


cgtactcctt 


tctgaaggct 


taggcaacaa 


taagttttac 


tgccttttct 


ggcttctgta 


ggctgcctac 


ctttcttggt 


ttgtcatccc 


atcctctacc 


ttcaaatcaa 


gcaatgcagc 


atcttcaaaa 


ctctcttgaa 


ctctattctg 


cctcttgctt 


ccacttataa 


ggattctgtc 


attactttga 


acccacctga 


ataatccagg 


aaaattctct 


cattccagct 


aataagcaac 


cttgatttca 


gctgcaaatt 


taatcctttt 


tgttgtgcaa 


aaagaacaat 


ggtcccccca 


gggaggtctg 


cattcaaatc 


cccagaagct 


gaggattagg 


ggactaggat 


gcagacctcc 


ctgggggacc 


attactctgc 


ttacaatcac 


gaagatatgg 


aatcatagta 


ttttctcaat 


gatatgagtt 


tctagtactc 


aataggatat 


gatgacaatt 


atctttatac 


taccttccta 


acattttaaa 


gcactttcat 


cttccatgcc 


cacaatagca 


ctgggaggtg 


ctgaactagg 


gaataaatgt 


ctgtgattat 


tttattagtg 


aaaaaagtga 


gaagctactt 


cagcagatca 


agaattggtc 


aagggtatgg 


ctgccattgg 


gaggcaggtt 


tcctgactcc 


cagccagtgc 


tttttctctt 


atgacattgt 


ggttgctggg 


aaatgttttc 


acttgtcatc 


tccatttctc 


acattccctt 


gttatgagtt 


ttctcctagg 


gtacggatgg 


catggtgctc 


ctcttaagtg 


tgctggaatt 


ctgcattgct 


gtgtccctct 


ctgcctttgg 


atgtaaagtg 


ctttgttgta 


gccccagtga 


ggtcagtatt 


ggccttcatt 


tgaaggtatc 


tgtattactt 


ttgtattgct 


gtataatcag 


ttaaacaaac 


ttatagtctt 


aacacagtac 


acattcatta 


tatttcagtt 


tccatggatc 


aagagtctgg 


gcccagtgga 


tgtgggtcct 


tggttcagga 


tcttatgact 


ctgtcatcag 


ggagttggcc 


ggggctgtgg 


ccttgtctaa 


agcttctgat 


tctctttcat 


gtgaatgtgg 


cggaattaat 


ttcctttaag 


ttgaggactc 


atgacaatgt 


actttttctt 


tgagaccagc 


aggagagcca 


gagtctctag 


tgctttaatc 


ctctcataaa 


gcattcacat 


gaagagatga 


ggtccatcca 


gaataatgag 


tttttttata 


aactgaaagt 


caacaaatca 


gtcctacata 


cgggacttgt 


atcccatctt 


atacacaggt 


ttcatccaca 


cttgagagaa 


gggaatcaca 


aggggtcagt 


gagtgtcacc 


ttagaactct 


gcctagcaca 


agagctaaat 


aaaagctctt 


tttccccttc 


ctttctcaaa 


gggcataacc 


tctgaatcca 


gataagaaaa 


tatttcacaa 


tgaagaggtc 


cccaaaagta 


cttggtttta 


ttctattatc 


ctattacaac 


aacttttcct 


ggaatacgga 


aggaatgtca 


tgatctcaga 


cttctctatc 


agtctgtgcc 


cacctgccca 
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taaaccaaaa 


aaaaattaca 


cagagagagg 


aaagtgagga 


atggaaggag 


agaagaaatc 


aagaaagagg 


ctctaagaat 


aatggggtgg 


gatatcctcc 


tagaattgag 


ggttccagaa 


tgaggaaatg 


gatagtaaaa 


ccccaacagc 


tactcattag 


cttcattacc 


caaagtctga 


aaggcagaca 


aaaccgtcag 



39480 
39540 
39600 
39660 
39720 
39780 
39840 
39900 
39960 
40020 
40080 
40140 
40200 
40260 
40320 
40380 
40440 
40500 
40560 
40620 
40680 
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ccaaagaata aagtaattgt agtttgagga 
tttgtgcctc tctgtcttct ttagggattc 

5 

accctccctc atagaagaga gattgactat 
attgtaaaac cctagaggtg agagattctc 
10 aagtgaagta actttcatgt catgatattt 

gaagctgggg aatgttgttt ccctaatccc 
ggggcatatc ttggctaaat ctctatctct 

15 

aatgttgcac tgagtttgtg gaagaataac 
tgctattcca ggccagtgtg gtttttgcaa 
20 tgggtatgtt gatgtcatca gcatgaacaa 

catgatacct tggtttaccc atcatattgt 
cttgcgagac cagggcagtg gtcatcagca 

25 

tgtcctaaat ctagcctgat ctaaagaatg 
aatgaatcag agaagaaaag agtaaactct 
30 gggcactctg aggtgatttg ggtgactcac 

caccatcaaa ttctcacatg gcagaaatag 
gccaccaaaa gattaacaga agaatgctcc 

35 

ccacatgaga aagtaccaga atccaactcc 
atatggaatc caattatgac ctctgtgtgt 
40 atatatatat acatatatat atatgtgtgt 

tttttcccct ggaactcaac aactaatttc 
aaattaataa ataatgcatt taatgaggca 

45 

aggactttat ataaaggcat taaactggca 
tattgctaaa acgaggtctc catgcaaaca 
50 ctcacttact tgactttttt tctgccatat 

agtggagatt catgcccttc tcccccctcc 
agtaccttgg aatagtatag tcagtccttg 

55 

tatacatgga gtaaaaatca tatgaaggat 
tcttcctctc ccttccccca ccttctgctg 
60 cctgtgtcac atatggaaat gtgcaacata 
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gattgtttct 


ctggtgccct 


tctccatcat 


41340 


ccaggcaata 


ccatatccca 


ctccccgaca 


41400 


agcgcattaa 


ctctagtggt 


ggggttttgc 


41460 


agcaattacc 


aatgattttg 


taatttcatc 


41520 


aagtctgtgt 


tccctttggc 


tctttgcata 


41580 


ctggtttgag 


tccagaaaag 


cagtggcttt 


41640 


tcctggtccc 


ccaccgtgcc 


ctaccaatcg 


41700 


cagaactttt 


cttagttttg 


tgatgatacc 


41760 


tgctttattt 


gtaaccagca 


aatcatgatt 


41820 


cctggtttgg 


aagaagagcc 


ccatattttt 


41880 


gcactgtcat 


ccacatgtaa 


gattgatcct 


41940 


gctcaggtca 


agagctatca 


ctgaatgttg 


42000 


actgagaatc 


cccacatgac 


tgatattgag 


42060 


gataatgatg 


actttatgtg 


ggcaggaaag 


42120 


ctatggcatt 


ctgattgttt 


gtgctaatta 


42180 


catctcccac 


accacttaag 


acagtttgat 


42240 


agaaatctat 


gctgactgta 


acacaagaac 


42300 


aatactgata 


gacatattga 


tatcattatt 


42360 


gtgtgtgtgt 


gtatatatat 


atatacatat 


42420 


gtgtgtatat 


attcaaaatt 


ttgttctcat 


42480 


attggccctt 


tatcgagagt 


actagaagtt 


42540 


gcagcacttg 


aaaggttttc 


attcatcatt 


42600 


aataagattt 


ggaagcagaa 


gggcaaaagg 


42660 


catacttctg 


ctcccctgta 


taacattcct 


42720 


ttggggacca 


aagtgctttt 


tccttcatga 


42780 


ttttccttct 


gctttccttc 


acccatagaa 


42840 


catgtgcaca 


agctatcatt 


tcagtaaagg 


42900 


cagattcaac 


ttatattttc 


tattttttct 


42960 


ggcagaatta 


tatcttaatc 


aaatgtgtat 


43020 


tggtatttgt 


taatgtttgt 


taattacatt 


43080 
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tgctttttta ttgcagagca aaaataaaat 
ttctattcag atgcaatatg atggaaagag 
5 gtggtaaata atgtggactt tctaatcaga 

ctttgttgtt tactacctat gtagctatga 
tttctcttca gcaaaatagt aataatattg 

10 

gtataataaa gatagagcaa ttagtactct 
aatacatata catatgtaca tatatacata 
15 tcagcatata tttgtaaagt atctacaatg 

atagcggtga ataagagaga taatatcctt 
gacacagatt aacataaaac aatcaaaaca 

20 

gaattaatgt gctggagagt ggttgttggt 
taaccaggaa agttcttttt gaggaagcga 
25 gaatttgcat ggacattcat attttccaac 

cactcacgta tctattgtat ccacctttct 
cctccttctt tcaatgctaa cctgtctact 

30 

gaaaaagtct attatgacat tctatcttct 
aaccaccaaa tagaacctta atactgttgc 
35 actattgcat agatgttcta tgtctcttat 

tttattactg gcatacgtgt gtgtctgtaa 
gcagaaatag caataaatgg aaatgaaatg 

40 

tgtcacaggg ttcatatagt agcatgatgg 
agcttcaagc ccaacaggaa agaggaaaga 
45 aaaatgatga gagtaatttg attggaaatg 

aggtcacaaa gctaccacag aacttcactc 
gagcagaact gagaagcgcc aagtgaagga 

50 

aagtcaaagt ttacctcaag ttgggccaat 
gaagatgatt aactgctata actgggcaaa 
55 ataaaagcca gaactcatac taaatcctag 

gtcttggatt ttgtaaaata tcctaactta 
aagtggcctg agaatgcctt ctgttggttt 

60 

taacacttac ataagcaacc taagattagt 
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tagaagcaat 


actttcatgt 


gctacctcct 


43140 


actaaatgtt 


gatgtcgaca 


gtactgtatg 


43200 


caaccctggg 


ttttcatctt 


gctatcccat 


43260 


acatgccaca 


aaatttatct 


gtgcttcata 


43320 


tctggcccat 


agacctatga 


taatttcttg 


43380 


ggtacatatg 


gagcctcaaa 


agctttaatt 


43440 


ttatttatat 


ttgtcagttc 


atcattttat 


43500 


gggcagacat 


tgttttaagc 


atataagaaa 


43560 


cctcttgtag 


gttaaacttt 


tatgggacag 


43620 


tttttatatg 


ctgttagaag 


ctaactgtaa 


43680 


agtggttgtg 


aaatatgagt 


cattagagag 


43740 


tattttcatt 


atgataagat 


aaaaagtatt 


43800 


tcagtatata 


ggatttctca 


ttctgcccta 


43860 


ccatgcacgc 


ttttcatggg 


aatgtgaggc 


43920 


ttctccagct 


ccggagatgg 


acagtatctg 


43980 


tggctgagga 


ttgaaaggta 


atctagcaag 


44040 


ttactattat 


gtagtatata 


gaaccttaat 


44100 


gggacatcag 


tgagaataca 


tgtagcctac 


44160 


aggaagtatg 


tttaaaagaa 


agtcaaataa 


44220 


aatgcttgat 


ttcatggatg 


agcatctgtc 


44280 


ctgctgattg 


cagcaaactt 


acctcttcac 


44340 


gagggagatc 


atttcttaga 


cagtgaagtg 


44400 


tgtcagtcat 


tcacacaatt 


aatttttcct 


44460 


tagttgagtt 


gatgtaattt 


gctcacagac 


44520 


tcagacccta 


gcaaaatcct 


ttgcatctcc 


44580 


ccctgacgtt 


ttaaatatat 


aaaggcaagg 


44640 


atccttcaat 


ggcttcctct 


ctaactcata 


44700 


aaatccttac 


atctcctoat 


tctttattct 


44760 


tatcctgata 


tattgccttg 


agattatttt 


44820 


aaattagcgc 


atacataaac 


tttttaaggt 


44880 


acatttttag 


ttgtaaaaag 


ctaatgcaaa 


44940 
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tattttaggc aattcatgca aaatattatg 
tatttcatat catgcaattt taaatttatt 

5 

aagtgattag gatgtaaagc agtaggaaaa 
catttacaag aaaaatagtt attcttttta 
10 ggctctgtca cccaggctgg attgcagggg 

ccttggctta agccatcctc gcacctcagc 
ccaccacacc tggcttattt ttgtattttt 

15 

ggatggtgtt gaactcatga gctcaggcga 
gagtacaggt ttgagccact gcacctggcc 
20 attccccaca acttactagc tgtgggtagc 

tataaaaatt acaatagtac ctaccttata 
aacttgtaat cacagaatga agattgacac 

25 

atactgatag gtgacatata aagaattatt 
aataaaataa taaaataagc taaagattaa 
30 tattatactt ttaaatattc aggaaaagaa 

tttaaataac tgtactaagt ataagagtaa 
accctgaaaa tcggtaattt tcattaactt 

35 

tattactgtt tcaaatttaa taccagaata 
tccctcaaac ctattttcca gtcattgtga 
40 cagttttcca ctgtcaatag taatacgcaa 

agagaggagt atttaacttc tacagatgtc 
ctggagtcag gtggaaagga tgatcctcct 

45 

taaatcaatt ttgtgtttag gaaaaactca 
tttggaaact aggactgttt tcggtttggg 
50 tttttaggac agaagaaaca tagtagggaa 

agtgtaacac aggtgcatgc attcagataa 
caagctccca tacattctat atacaaaaac 

55 

gtcattaatc ttcattgtaa acacattcat 
atagcaacaa tttgccaaca gcacacacca 
60 tcctcaccta caccaattcc tttcctttag 
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atttcaactt 


gtagagaagg 


ctgcctcatt 


45000 


ctttatttgg 


ttattctagg 


ttctatctat 


45060 


cgtccatgaa 


aactttgcct 


tcttttgcta 


45120 


tttatttatc 


taatttttga 


gacagggtct 


45180 


tatgatcttg 


gctcaatgca 


acctccacct 


45240 


ctcccaagta 


gctgggctta 


caggcataca 


45300 


tgtagagaca 


aggtttcgcc 


atgttgctca 


45360 


tccacccact 


ttggcctccc 


aaagtgctgg 


45420 


aagataatta 


ttctttacat 


acatgattaa 


45480 


ttatcattgt 


gtttcttcag 


tttcctcatc 


45540 


ggctcttaca 


aagactacag 


gtattattaa 


45600 


tggtaggcga 


tatgaaaaga 


atggtgagtg 


45660 


aaacaatata 


aaagcaaata 


acaaagtaaa 


45720 


cataagtctt 


aatttgagca 


cataagcagt 


45780 


acagataagt 


gtgtccacag 


aattctgatt 


45840 


tgtccatatg 


ggacacaaag 


aagctgcact 


45900 


cagtgaagtt 


ttagaagtgg 


gaatttataa 


45960 


atctaagatt 


ctctaaaaat 


ggagtcatct 


46020 


gttaggtaaa 


atgagattcc 


aactgtagta 


46080 


ttgggataaa 


gtagtggatc 


tttttctcta 


46140 


atattgagaa 


tgacatggca 


gaacaggcac 


46200 


acaagtaatc 


agctgggcat 


gcatatgata 


46260 


gtgtgggtca 


cttagtggct 


ttgatgtata 


46320 


taatagctct 


cagttggcta 


tctgagagga 


46380 


caagaaaaag 


actataatct 


aagccaatgt 


46440 


ttttagaatg 


ctactgacca 


gattgaatgg 


46500 


atcagaaagt 


tatttaaaaa 


tgtgtactct 


46560 


agcaccaaag 


gataggtttg 


ttgtggaaat 


46620 


taaatcttct 


tcaaatcttc 


aagagatagg 


46680 


atcatctcaa 


cttgtattaa 


taagacgtag 


46740 



-172- 



WO 02/062946 

aggcaatttg attctctgtc tgcttagtga 
taggccaaga attgcctttg tattggctgg 
5 caattagtga cagtggttaa ccatcaattt 

cataaggtct tgagtatata aggtgatatt 
cctattcagg tagagtcagt atacttggga 

10 

ttatcgagtg ctcaatggtg cccaggctgg 
acctccacct cccagccgcc tgccttggcc 
15 cggccgccac cccgtctggg aagtgaggag 

cgtgaggagc ccctctgcct ggctgcccag 
tgccatccca tctaggaagt ggggagcgcc 

20 

gaggagtgtc tccgcccggc cgcccatctt 
gccctgtctg ggatgtgagg agcacctctg 
25 agcgtctctg cccagccgcc ccatctgaga 

ccgtctgaga agtgaggagc ccctccgccc 
gtctccgcca ggcagccacc ccgtccggga 

30 

gctgccccat ccgggaggga ggtggggggg 
ccgggaggtg aggggcgcct ctgcccagcc 
35 cccggccagc cgctccgtcc gggagggagg 

ccccgtccgg gagggaggtg gggggttcag 
agggaggtgg gggggtcagc cctctgcccg 

40 

gggttcagcc ccccacccgg ccagccgccc 
cccgcccggc cagccgcccc gtccgggagg 
45 tgggaagtga ggagcccctc tgcccggcca 

tcattgagaa cgggccatga ggacaatggc 
ggggaaaaga ttgagaaatc ggatggttgc 

50 

agacttttca ttttgttctg tactaagaaa 
gtgaccttac ccccaaccct gtgctctctg 
55 tggattaagg gcggtgcaag gtgtgctttg 

ttaagagtca tcaccactcc ctaatctcaa 
cgcagggtcc tctgcctagg aaaaccagag 

60 

tccctccact attgtcctat gaccctgcca 
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ttgcaagtgg 


ctgctctgct 


gggttgaagt 


46800 


tatgtacatg 


tctgagagca 


agggctctgg 


46860 


gctctgtgaa 


taataaggat 


aaaaacaagt 


46920 


actgataact 


tctaactata 


agtagtccaa 


46980 


ctaaaaatga 


tttatttttt 


aaaaaagtat 


47040 


agtgcagtgg 


cgtgatctcg 


gctccctaca 


47100 


tcccaaagtg 


ccgagagtgc 


agcctctgcc 


47160 


cgtctctgcc 


tggccgccta 


tcgtctggga 


47220 


tctggaaagt 


gaggagcgtc 


tctgcccggc 


47280 


tcttcccggc 


caccatccca 


tctgggaggt 


47340 


ctgagatgtg 


gggagcgcct 


ttgccccacc 


47400 


cccggccgcg 


accccgtctg 


ggaggtgagg 


47460 


agggaggaga 


ccctccgcct 


ggcaaccgcc 


47520 


ggcagccacc 


ccgtctggga 


agtgaggagc 


47580 


gggaggtggg 


ggtcagcccc 


cgccaggcca 


47640 


gtcagccccc 


cgcccggcca 


gccgccccgt 


47700 


gcccctactg 


ggaagtgagg 


agcccctcta 


47760 


tgggggggtc 


agaccccgcc 


cggccagccg 


47820 


ccccccaccc 


ggccagccac 


cacgtctggg 


47880 


gccagccgcc 


ccggccggga 


gggaggtggg 


47940 


cgtccgggag 


ggaggtggag 


gggtcagccc 


48000 


tgaggggtgc 


ctctgcccag 


ccgcccctac 


48060 


ccaccccatc 


tgggaagtgt 


acccaatagc 


48120 


ggttttgtgg 


aatagaaagg 


ggggaaaggt 


48180 


cttgtctgtg 


tggaaagaag 


tagacatggg 


48240 


aattcttctg 


ccttgggatc 


ctgttgatct 


48300 


aaacatgtgc 


tgtgtccact 


cagggttaaa 


48360 


tt aaacacrat 


qcttqaaacrc 


acrcatac tCQ 


48420 


gtacccaggg 


acacaaacac 


ctcggaaggc 


48480 


acctttgttc 


acttgtttat 


ctgctgacct 


48540 


aatccccctc 


tgtgagaaac 


acccaagaat 


48600 



-173- 



WO 02/062946 



PCT/US01/48437 



ggtcaataaa 
ttttttcctt 
catatatttc 
caaatagtat 
tcagtcaatg 
gaattgctgc 
tatatttata 
aatcattgtt 
ctgaatacta 
gtcattctta 
catctcgggt 
gttttcaaaa 
gtaaacttgc 
gaacatgaat 
aataattgat 
tcttgttatt 
ttatttttct 
ttagaattag 
ggtattgaat 
aatccattaa 
ggttttttca 
ctggtactaa 
taagttttga 
gtactctaat 
caagtatcct 
tttcttactc 
catccagctt 
ggaaattcca 
ttacatgtta 
ttcctactct 



aaattaaata 
agacttactg 
tgtccccttt 
aagaaacgta 
ataaagaaat 
aaaaactgct 
aaatcaaatt 
acaacagtta 
tatgcaaagt 
cattacacag 
gttcacaatg 
tttgccattt 
aatactctgt 
taaaccagaa 
gcattttatc 
aatttttatc 
tcattattaa 
catgtcaaca 
ttataggtct 
cagaacatac 
ttttctgtta 
attttctcag 
ttaatatttt 
aatgtcttat 
tatctttgtg 
atttttagca 
gtggctagtt 
atgatttata 
gactacttca 
ttgcttcaga 



aataaataaa 
aagcaaaatg 
atcctctaaa 
atgatactaa 
gggtaatcaa 
gtctattgag 
aaatccactt 
taagaggtat 
caggagcaaa 
taaattggat 
accattaaaa 
tccacaactc 
ggaaaggatt 
aaaaaggaaa 
ttaatacttg 
attaaaatgt 
aattgatttg 
tatccaatcc 
ataagggaag 
tttttaattg 
acatgaggaa 
atctttttgt 
caaaggatgt 
gactgtcttc 
ataatgagtc 
acctaattaa 
aggctgtttg 
caaatatgtt 
tactgtccca 
ttatagagat 



taaataaata 
aagacttccc 
atctaatttc 
aagattaaag 
taatgagcaa 
aaaacatgca 
gtaaatgtaa 
ttctatattt 
tcaacagttc 
catagagcat 
atctgtagtt 
ttatttttct 
tcaagagaga 
tgagttgaag 
actggaaatt 
ataatagctt 
gccattctaa 
taaaagacct 
aattgctagc 
atttaggcct 
cgtcctttga 
ctgttgtctg 
ggatcccatg 
tggcttacat 
cttttccccc 
gttatgcttt 
gatctgtggg 
ttaaattcat 
ctgatgcttt 
tctatttcca 



aaaagtattt 
actaagctaa 
tcttcactca 
aatattcaga 
tttgtaagtg 
tccaactatt 
gagaagcctc 
gaggttgtaa 
taacagtcta 
gaaacagatt 
gaacttagtt 
tgattctgag 
tatactttta 
gccactgctg 
tcctgtttct 
tatagtagca 
gtctttattt 
tcctgaattt 
ttaacaatat 
tcattaatat 
cattcattgt 
aatacatctt 
ttgacagctt 
aatgtcttat 
tctgctgtta 
ggtgtagttt 
tttatacttt 
gctctcttct 
attaatgttt 
ccttccagtt 



atccagtatt 
atttgtgtat 
tcctatatat 
cgtgattttt 
tactataata 
aaaggtaatt 
aaagggcttc 
aataataata 
acagtgatgt 
caggaagaat 
taaattaact 
tcctgatcct 
tctttaataa 
tggctactga 
ttggtttgta 
catatctatg 
catgtaaatt 
tgattaggag 
taagtattct 
gtctcagcaa 
attgattttt 
tattttacct 
ttgccttcaa 
gagaagtctg 
aaacattctc 
gatttttctt 
tatcaaattt 
tggactcaga 
ccagactttt 
cactaacatt 



48660 
48720 
48780 
48840 
48900 
48960 
49020 
49080 
49140 
49200 
49260 
49320 
49380 
49440 
49500 
49560 
49620 
49680 
49740 
49800 
49860 
49920 
49980 
50040 
50100 
50160 
50220 
50280 
50340 
50400 
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ttcttctgga 


aaacttaatg 


tgttgttact 


cataggcagt 


gatttattca 


tttcatatac 


50460 


tgtagctttt 


atctctagaa 


gttctatttg 


attttttata 


tttttatact 


ctttgtgtcc 


50520 


aagtatttct 


tgaaattctt 


atgcacattt 


ataatgaata 


ttctaaagca 


cttctctttt 


50580 


attgtaatct 


ctataattta 


caggtctttt 


ttattaattt 


ttctcctggc 


tatttgttat 


50640 


tgtgattttt 


ctgtttcttg 


ccatgtatag 


taatttggga 


ttagatgctg 


gatattatga 


50700 


tttttatttg 


attgagtacc 


caaatttttc 


ttttttcaca 


agatattcca 


ttttatttct 


50760 


tgcaggaaat 


taaacttaaa 


gaccaatttg 


attcttttga 


gacttttaca 


aaggcatgat 


50820 


cgggagggtg 


tattagtccc 


ttttcacatt 


gctataaaga 


actacctgac 


actgggtagt 


50880 


ttataaagaa 


aagaggttta 


attgactcac 


agtttcacag 


gatctacaag 


aggcatggct 


50940 


ggggaggcct 


caggaaactt 


acaatcatgg 


cagaaggtga 


^ggggaagcc 


agcatgtctt 


51000 


atgcagtgga 


agcaggagga 


ggagagagtg 


aagggggaag 


tgctacacac 


tttcaaacaa 


51060 


ccagatctca 


tgagaactca 


ctattatgtg 


aacagcaaaa 


ggagagtcca 
• 


ctcccatgat 


51120 


ccgatcacct 


cccaccaggt 


ccctccccca 


tcattaggga 


ttacaattca 


acatgagact 


51180 


tgagtgggga 


cacagagcca 


aacagtatga 


gaaggtttat 


cattgttatt 


aattgggggt 


51240 


atgctttact 


aataatttgt 


ttattattaa 


cagattaatt 


attaaagcat 


taataataat 


51300 


tgacttatta 


ttaataacct 


ttctggaggt 


ctctattcta 


tcccccaagt 


attcatggaa 


51360 


gtctctgcat 


tccagctgat 


tggaacttga 


atgtctccca 


gcattgtgtg 


agccctggta 


51420 


atcatacagc 


ttacaactac 


ctggctagtc 


cttgactgta 


cttttaaaat 


ttcaacctat 


51480 


aaaggtacag 


ctttgtattg 


gacaacaaat 


ttaggatact 


tgcagatttc 


tggaactctg 


51540 


tctctggaga 


gctatcccct 


ttattgttct 


cttctcatta 


attccagctg 


tttcagtgcc 


51600 


ctcaaattct 


ggtttttgtc 


ttatgaattc 


agcaatatca 


gaataaatct 


ctctccctac 


51660 


catggtatgg 


aaagtttctt 


catgcagaaa 


accagagcaa 


ttattatatt 


cactttgttt 


51720 


tttcacttgt 


caatgttgaa 


attcctgcac 


taactttgtc 


caatgtccgg 


aaagagttgt 


51780 


gcttattttt 


ttccagtttt 


ctttatgatt 


taaaaaattt 


tattgagaca 


agatcatcca 


51840 


ggttcaagtg 


cagtggtaca 


gtcataggtc 


actgcagcct 


tgaaatcctg 


ggctcaaagg 


51900 


atcctcccac 


ttcagcctcc 


caaagtgctg 


ggattagaag 


catgagccac 


tgggcccaac 


51960 


ccagtttttt 


aactgtacag 


agtggaagat 


taagtctggt 


tctatcataa 


ttggaagagc 


52020 
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atatttaaaa 


tttacttaag 


cttgtcaaca 


tatgtaactt 


aattaaatca 


atacttctaa 


52140 


tttaaaatat 


ccaagcagaa 


aatttaagta 


ttatcatttt 


tatatgctgt 


tatttttagt 


52200 


caccttattt 


aagtttcatg 


aagtagctat 


taaaagaccc 


tagcacattc 


cagttatttt 


52260 
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atatttaagt 
aatatatgct 
aagttgtaca 
ggtttaaatt 
tttgattgcc 
aagctgacaa 
tactgactca 
tgtgaataac 
tacactcttt 
ttttcatcgt 
tgttttaatc 
tcatctctta 
agtcgccgtc 
tgttgattct 
agtaatttat 
atgaaaatat 
gtcaccaaat 
atatatacac 
aatatatata 
tattacacac 
aatacattta 
atgtttttaa 
actaatggag 
atccaaagaa 
agaagagttc 
aattaagaga 
agttggaaag 
aggaaacaaa 
aaggagagat 
aataagagat 



taatttgtga 
caaatgtctt 
cattccaggg 
ttgagtttac 
tggtggatag 
tgcttttgcc 
ggtttcttgg 
tctgtcaggc 
ctagcagaag 
caatttcagc 
cttctcagac 
aatggatgct 
tggatggggc 
ttttccctta 
tttttatctt 
ttccacctta 
tgtctaaata 
acacacacac 
tgcattttta 
aaatatgtta 
aatatataaa 
aaacataata 
aaaggaaaag 
taatatctca 
ctagtgtatg 
agcacagatg 
tgggtaacga 
agcacaaata 
ccttggaact 
attttatgga 



tattttccaa 
ttcttcacac 
aaaatattta 
tgtatgaaaa 
aaaaacaaaa 
ttcctctttg 
gattgctgct 
tgataatact 
aagcagttca 
aaagcgtggt 
tctaggtaca 
ttcagcttgg 
tcaggaatca 
tgccagcagc 
aatttttttc 
cctccagctc 
acatacatgc 
acacacacac 
agttacttga 
tttgaaaata 
taattgtata 
tatgcaaagg 
gagatgctgc 
ctagagattt 
gggtacgaga 
tccttaggga 
aaactgagat 
attgttttgg 
agaaaggcca 
taatgtgcac 



tgttttatat 
aacaaaaaat 
aatcttttaa 
acacaggaaa 
aacaactaaa 
aatctcttca 
gctcatgtga 
agattaagaa 
gaaagttcct 
attttccccc 
gtttcaccag 
ccaggagggg 
catagtgctg 
aaagtgcttc 
tacttagtat 
tgaagccaca 
atatatatat 
acacacacac 
aaatatattt 
catttaatta 
tataaatatt 
gagagataag 
ttactagctt 
tatatgtgct 
ggtatcatgg 
atagaagatt 
gcagatggga 
gtccaaagag 
gagagacaga 
agggagttgt 



agtaatatcc 
acaagaaata 
ttaattaaaa 
gatgtataaa 
tttgagatac 
agatcagcaa 
ttcccaggaa 
aacaattcat 
tgtggtttcc 
agtaaaaggt 
acacagtggg 
ctggacatca 
tattagctag 
aacccttgct 
tccctcatat 
gacactgtgc 
atatatatat 
acacacacag 
aaatacataa 
tataaataat 
tgtatattac 
tgaaggagat 
ctagcttttg 
tcagaaagat 
aatggaaggg 
ttctggtcct 
cctgggaaac 
ttaaggcctc 
cttgctcatg 
acgtatactt 



agacagtttt 
taataagaca 
tatgtccctt 
gcaacatcct 
agtttgaaca 
tttatttttt 
cttaactact 
cttcaataag 
cttctgaatc 
caagatagct 
taacttgctc 
agaggaggaa 
ctgtcatcta 
gattacttcc 
ttggaaaaag 
ccctaacagg 
atatatatat 
gcacacatgg 
ataatttgtg 
tgtatattta 
atataaaggt 
agaggaatat 
ttttccagca 
gtatagaaag 
ataacatgat 
gaaggagagg 
agggctagct 
cttattctct 
gacctgcagg 
agaggaactc 



52320 
52380 
52440 
52500 
52560 
52620 
52680 
52740 
52800 
52860 
52920 
52980 
53040 
53100 
53160 
53220 
53280 
53340 
53400 
53460 
53520 
53580 
53640 
53700 
53760 
53820 
53880 
53940 
54000 
54060 
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caaaatttct ctccaacccc ttgttgtcaa 
agaaacttgg gataactaga tgacctctca 
5 cttgagtgtc atcagcaaga cggtggaata 

gagactgact tgacaacaat gtatgaacta 
ttaagaggtt gcagcacctt aggtgaatgc 

10 

agaaattgtt gtcgcatttt actcattcca 
tgatctacag aaaacattaa agttccagtg 
15 gtaagcaaaa actcagtttt atgtaaaaaa 

gggctggagt cctttaaaat taactgccta 
tctgtctttc ccaggcatgg tgaagcattg 

20 

agacagataa tctcaagtca taaaacatgt 
gcatgtctta attaaataac tgtctttgtt 
25 cagatctccc cccaccccac aaaatgctta 

attcctttgg atgttaatct gactggcttg 
caacctctct gtctctctga ttacttaatt 

30 

ggcaagagga ctagaccatg agtccagcat 
tggtttctgt cttgcctgac tcagagtgca 
35 aacaaaggca ggcacactat ttcaccacta 

gggggagaaa gaagaagaag caggcaaaac 
ccaaagaagt ttgaagggcc cccagaatct 

40 

ctacataaag tcagtttgta atgactggaa 
ctcaataaaa gatcacaagg catacagaga 
45 acaataaatc tccaaaaact aactttaaag 

taaagatatt tgtacacata tttattgcag 
aacccaaatg tacattaatg gatgaataaa 

50 

ttaatcaacc ttacagaaga aggaaatcct 
agatctcatg ctaagtaaaa gaagccatca 
55 acaaatactc tgtgattcct ctcatatgag 

ctgtagacag aacccctcag acaccaagtt 
cttcagcaag actcacgtct ccaaaaacca 

60 

ttaagggctt acaactttaa gtgggtccat 
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acataagcat 


ctgtttttga 


gctctgacac 


54120 


gcacaatagg 


ctctagcaat 


aggaaattga 


54180 


ggaagcctca 


ggccctcatt 


tccccatgga 


54240 


caatgccttt 


gtgagaactc 


tagaaactag 


54300 


aaagccagga 


aaagatgcat 


caaaatgagg 


54360 


gccactcttt 


caccctggca 


caatgcgact 


54420 


gaacaggaat 


taaaagaaat 


taaaaactgt 


54480 


aaacccccaa 


tttcccttaa 


agaagagaaa 


54540 


tttttctttc 


tgaggctagt 


gagccttatc 


54600 


tttttctagc 


tgtgcagctg 


caagatcact 


54660 


tgtttcttga 


aaaggaagaa 


atgatgtaat 


54720 


tctcacttct 


gtaatatgct 


tcctcctgca 


54780 


aaaggtaacc 


acattctttg 


ttcagggctc 


54840 


gtgcacctaa 


aaaattaaat 


aattcctccc 


54900 


atcccactgc 


accagattct 


cctttaaaga 


54960 


tctgggtttt 


gtgggggtta 


cccaaggtac 


55020 


gagttgaata 


gcatgttgtg 


agcctcggag 


55080 


gggagattgt 


agtactgcag 


gcagacacca 


55140 


tcttttacta 


ggaaattaca 


catacatgtc 


55200 


ctagctatgc 


taacccgtaa 


aggtcttcca 


55260 


aaggttggtg 


ttgttttcaa 


atgcacaaat 


55320 


agcaagaaaa 


catggcccaa 


ctaaaggaat 


55380 


aaataagtat 


ctatgagtta 


catgatcttg 


55440 


cattattaat 


aatagccaag 


aggtggaagc 


55500 


gaaaatgtgg 


tatatacata 


caatgaaaca 


55560 


atcatatgct 


acaacaagaa 


tgagacttgg 


55620 


caaaaaaaaa 


aaagaagcca 


tcagaaaaag 


55680 
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5574 0 


aaagaaggaa 


gggctttatt 


tggccgggag 


55800 


agttcccaga 


gtgagcaatt 


cctgtccctt 


55860 


gtgagagggt 


cgtgatcaat 


tgagcaagca 


55920 
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ggggttatgt gactgggggc tgcatgcacc 
tcacagtgct tttccataca atgtctggga 

5 

ggtcgatctt taaccaggcc cagggtgtgg 
ccgcctttta gtttttactt cttctttctt 
10 aggggtggtc tcctccttta gtcccccgct 

cactttcatt ttcactaccc atgtcttctt 
acatttgtgc tgaagcattt tggtgaacta 

15 

gaagtacagg tagcaaacaa gagagcagta 
ttataagagt tttaaatcct cctagcactg 
20 catatccatg ccacacttgc acgggcacat 

cttcaactac ttgcccttgg tcatctatgt 
tacagacccc tccttcagct gccagcaagt 

25 

catttctcat ctgagtttct tgctgggcca 
tgattatttc taagacagct tgtaaccata 
30 agtatcccca tgagctgtct tttgcccaag 

caggggacca tttattatct ttccaatttc 
catagacagg gaagcccagg agttcacctg 

35 

taatagtgcc aataacacaa ctactttccc 
gcccacatat ccagtataat ccagtggggg 
40 ttcacatggt ttacaacttt gggaatttac 

gccacggtgg ctgtttttgt agtactatta 
cccacaggaa gggtgaagtc cttccccact 

45 

gctttcaaga cccagaagtt atcagggtga 
gggtctgtag gtactaattc tcaggcttcc 
50 ccgcataaat aacatgaagt gacattgaga 

aacaaatttc ttgtttttcc tggaatttct 
tttgaaatac tggctcagga gagggtttat 

55 

aaggatccag tccagcccca tcgatttcta 
gatcaagggg gttggttatt accagttcta 
60 ggccactttt ccttttctga aggtggacag 
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agtaatcaga 


atggaacagg 


acagggatat 


55980 


tccatagata 


acataaccag 


ttagctcagg 


56040 


tgctgggctg 


tctgcctgtg 


gatttcattt 


56100 


tggaggcaga 


aatgggcata 


agacaatatg 


56160 


ttgagaatct 


cactcattag 


tgggagttct 


56220 


gcaagacaga 


tcaatagtga 


ttcatatagt 


56280 


aggtagcgat 


gaagcttttt 


atcatttgaa 


56340 


agcaggtttc 


tattactatt 


attactccta 


56400 


ggaaccattt 


tccaaacatg 


gccccaggat 


56460 


gtgccagttt 


tgtcatattt 


ctaactatgt 


56520 


gtagatagca 


attagtaagg 


ttaaatgtcc 


56580 


agttgggagc 


caatctattt 


tgatagatag 


56640 


gaatagtcaa 


ggctctgctg 


gtcttattag 


56700 


tgctttggtt 


gagcatgtaa 


atgggggtcc 


56760 


tagcaggcca 


ataatattgt 


atgattcttt 


56820 


ttatggctat 


gtttctcttt 


tcatgggaag 


56880 


tctttatggg 


cagtaggaag 


aaagatgatt 


56940 


actggtcagg 


taatttggca 


taagttctat 


57000 


ctgtccagtc 


ctggtggtac 


tctgggtggg 


57060 


taaatggatt 


tttcttagtg 


tggtttgaac 


57120 


tacagttttt 


gcccaaggca 


gctgggtctt 


57180 


cttgctatac 


agtattgtct 


aatgattgag 


57240 


ttctcttgag 


ccaggaattt 


atcaggaact 


57300 


catggccatt 


gatctcccag 


tatagttcct 


57360 


gactgggcta 


catgcttggc 


taattgcaaa 


57420 


ggcactggca 


cattcagttc 


atcatagaag 


57480 


aaacttcttc 


tcaaactatg 


atatttactc 


57540 


gggttacatg 


ctcccctttt 


ttccagggag 


57600 


aggggttaca 


cggacccctg 


gtataggaag 


57660 


gatttttttt 


ttcatttttt 


atccaagtag 


57720 
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cctaaatgac 


acaagaccag 


tatccacatt 


cattttcaca 


cagtcctaat 


tcatgacaaa 


57780 




tgtacttatt 


ttttgtcata 


tagcctcttt 


tttaattaag 


agaaccatat 


cctatttcta 


57840 


5 


acttattact 


attaatgaca 


gcacaggcat 


caaatttcaa 


ggtgactttt 


ttgg'gcaccc 


57900 




ctttttcttc 


tgttttggct 


aacactttaa 


tcatgtcatt 


tatgagcccc 


cacgagtcct 


57960 


10 


cagtccttaa 


tcttatttca 


aaaactgtgg 


tcatgggaag 


ctcagatggg 


tcataacaca 


58020 


cataacacag 


gtcatttcct 


gggttacata 


ccttgtatag 


aataacatta 


tacaaacaag 


58080 




ttattttcag 


agttccagta 


cacttataat 


aaccataaaa 


taataggacc 


atagcaactt 


58140 


15 


tttgtcctac 


ctcagtgact 


tggtgtatac 


actgggaaca 


gtcctcagtc 


tgaggaaggt 


58200 




cagttgaagt 


ccttactgta 


caagttcaaa 


ttttaaggaa 


aatgagtccc 


atgatgagtt 


58260 


20 


ttcttatgct 


ttggccatgt 


gtggaccagt 


cagcttctgg 


atgtgactgg 


agcagggctt 


58320 




gttgtcctct 


tcagagtcac 


tttgcagggg 


ttggtgaaac 


tgctactgtc 


cacatacagc 


58380 




tcacagtcta 


ctgatgttca 


aggatggtct 


tggaggttgg 


gcctgctaga 


ataaactagt 


58440 


25 


ccaatacctc 


tacatattta 


tgttcagctg 


agctctctga 


taccgggaac 


aaggtggcag 


58500 




ggtttagggt 


gttgcaaact 


tcaatggtta 


tgggggattt 


tcacatagca 


agctttgata 


58560 


30 


cttggttaat 


ctagcatttg 


ttagccaatg 


atgtcctttg 


gtattcatca 


aagttaccac 


58620 




agcatggaag 


gcctttatat 


tcaggttttg 


cctaagcgtt 


agtttatctg 


cttcttgtgc 


58680 




taacagggct 


gtcactgcca 


gggcccctta 


gacatggggg 


ccagcctttg 


gaaactttgt 


58740 


35 


ctagttgttt 


tgagagatag 


gccactgtcc 


ttggccaggg 


cccgacagtc 


tgggctaaaa 


58800 




ctccaactgc 


cattttttct 


ctttctgaca 


catagagtgt 


aaagggtttt 


gtcaggtcag 


58860 


40 


gtagccccag 


ggctgaggcc 


aacatgagtt 


tttcttttaa 


ctcatgaaaa 


gttcattgct 


58920 




gttggttgta 


atagatgtag 


ttcatctaat 


ctacatattt 


atttacagtt 


atctactaaa 


58980 




atgttgactt 


taatcttgca 


gctatttgat 


ttcaagcttt 


aaagtgatct 


ggtattcccc 


59040 


45 


atgggaatcc 


aattgcatct 


aaatagacgt 


gagagtcaaa 


agacccataa 


ggggcttctc 


59100 




tcgctttaca 


atgtcttatt 


tttcctccct 


ctggctgatg 


aaatgccagg 


gtgaaaggga 


59160 


50 


tagccaattg 


gactgaagta 


caagtgccac 


tccagttatt 


cagcagagtg 


cccagtaaag 


59220 




gtccaccaca 


ataccaccac 


acatctgctc 


tgggatgaat 


aagggctgac 


tgattgataa 


59280 




gctcttgaaa 


attcttaagt 


tcactacatc 


ccttcaggtt 


tccaaggaac 


gctaagtttc 


59340 


55 


r , t _ t - r , r , 'l~3l~t"cr 


faaaaaarar 


aaaatoaact 


taatattaaa 


aaatcrocacf c 


t q q a t Q a c c c 


59400 




tcgggggctg 


aacttcaggg 


tgccagactt 


tgggatatag 


cagagagagc 


ttggcatgac 


59460 


60 


ttattactcc 


aggctgtaga 


atcctggaaa 


agagatacca 


tgcagcccac 


ggtccactgg 


59520 




aggaccacct 


tagtggaaag 


gggactgtct 


gggcctctgg 


cctgccacgt 


gcacaagcat 


59580 
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aacaattgct 
catcttggta 
tctagtaaaa 
ttgctcttta 
gggcaagact 
taatttacta 
gagagctgtc 
atagtttgag 
ggaaagaaga 
gtagggtttt 
ggtgtgatga 
agggtagggt 
aatgtgatct 
aagactttta 
aaattgacct 
ctttctactg 
caaacagctt 
cacccttata 
atttatttaa 
tttataatct 
ttaatgttca 
cacacagaat 
ttaaaaatct 
ccagaggtgt 
tacattcagg 
ttataactct 
gacttattac 
tactatataa 
gggataacca 
gctacaagat 



tttgtttaac 
tcttgtctta 
tgaatgtatg 
gtggtcaaca 
cctggttgac 
atgcccagtc 
tttgacttgg 
gcaaaattga 
aagagtaata 
cccctgggac 
gtccatcctt 
ccttcctagg 
tcagactggt 
gttttgaggg 
tttgttttaa 
agaaatttcc 
tttatatttg 
ttagtgtgtt 
tttttaatgt 
ttgttaaaga 
gttcacagaa 
ttcctttaca 
tttaatgtag 
attttacttt 
aggcctagtt 
ttttctttca 
aaacatttct 
gattcttttt 
ttcttttcca 
tagaagttac 



gtgcaaatgg 
actgccaaag 
attttaggaa 
gaacgttgga 
actggagtct 
tgaggagagt 
caagtcccca 
cttggttatg 
gaatagataa 
tatggcccac 
tttccactgt 
ctggctcgaa 
gctggcttac 
aaaggaaagt 
atgtggggat 
tttagcacct 
acgcttcctg 
attaatgtta 
tagatcataa 
gcaggttaat 
aaactggatg 
attgacgttt 
gtaaaaatcc 
tcttatacac 
acttttaaat 
tgacttttgt 
ttctttaaac 
acataaattc 
aagcaaactt 
tattatacat 



aatatttgat 
tttgttttaa 
attacaaaaa 
ccaactatgg 
ttaccaaagt 
caggagggac 
cagggtataa 
ttaataacta 
aagagttaaa 
gactctggag 
atgaacagca 
tttcctcttt 
cagaaattgt 
gggagataaa 
gtcagcagtg 
attttttatt 
tatgattttt 
aacttagttt 
ggtaagattt 
gctttaagag 
ataccccttt 
taaaacttgc 
acattcttat 
cttgcacata 
tatacaacat 
agacaattct 
aaccagttaa 
tgcccccgct 
cctttatgtc 
gttatactgt 



ccattttaac 
gtctttaact 
ccagttgggt 
cataaaagtt 
ttcccccaat 
agaggtactt 
taaggcaagc 
gatggtcagc 
tttttctcag 
cgggtggagc 
gtcttggtgg 
ttttttcact 
aggggtggca 
ccaagtatat 
gactttatag 
agtttttaga 
ataccagata 
taataaaact 
ttatagactc 
aaacccattg 
aactttagcc 
ttaaaccttc 
gcctccttat 
aactgttttc 
ttcttgcatc 
tcaacatgcc 
tttatttcag 
tttttccccc 
tgtggattag 
taacttttag 



caggcatttg 
tctatgatcc 
cagtccatcc 
ctacattggg 
caaatagtcc 
ttctgaagta 
attaaatgca 
aatagagtga 
ctttagtttg 
tttcttgact 
ttagcagcac 
ttttgatgag 
catgtgttaa 
aatttttaag 
tccttggtgc 
ccaaagaagc 
agctaaatgt 
tagtaggcat 
tttttaacct 
tgtttttatt 
aatatgttta 
aaaacaattt 
aatgctttta 
tcagtagttt 
aattcttttt 
tcaactttct 
gacaagaatt 
tttttttggg 
actgtctaag 
caaattttac 



59640 
59700 
59760 
59820 
59880 
59940 
60000 
60060 
60120 
60180 
60240 
60300 
60360 
60420 
60480 
60540 
60600 
60660 
60720 
60780 
60840 
60900 
60960 
61020 
61080 
61140 
61200 
61260 
61320 
61380 
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ttttgttgaa 


aaccttgtac 


gtttgggatt 


tcaattatcc 


tttgctatta 


gtaagacttt 


61440 




gtttagtcca 


aattaactta 


gaattggtat 


agacggcttt 


ttcttttcct 


tccattacct 


61500 


5 


gggaggaacc 


gtctatcatc 


cagtcctgaa 


gggagttcct 


cctaggtctg 


gtcagacctt 


61560 




tgtatggcaa 


ttaagattta 


gatcccctgt 


taggaaacct 


gctgggttaa 


gggaattttt 


61620 


10 


agtggttaat 


gttaaatcat 


cttttttttt 


ttcccttagg 


atacttctga 


actggtgagg 


61680 


tgtgctcaca 


atgaggtttc 


ctctaaaagt 


tatttttcta 


ctttcttctg 


ttagcaaagt 


61740 




agttgccgct 


acagagtgaa 


tgcatttgtg 


ccatccgcag 


attactaggt 


taaagatttt 


61800 


15 


tgataggaag 


tctacaggtt 


gtcagtggcc 


tcagtgcttt 


cgggctacgc 


acttgtttac 


61860 




acttacagca 


aggtggtatt 


ggagtgttgt 


agggtcacag 


agaagacctt 


caattatcaa 


61920 


20 


ttataggttt 


taaatttacc 


ctgtctttta 


aaggaatagg 


gtacactgtt 


tttttcttaa 


61980 




ctacttgtat 


atctctttct 


ttctgtcttt 


gactttctgt 


gtctttctct 


ttgactttcc 


62040 




ttttgcctct 


gtatctttct 


ctctctctct 


ctctgcctct 


ttctttctct 


ctctctctcc 


62100 


25 


ttgactccct 


ctttgtctct 


ctgtctcttc 


ctctctctct 


ctctgctggt 


ctttccttgc 


62160 




ctctgccagc 


tgcttatgct 


gctgttctct 


caaccactgt 


gagtgggtgg 


ggtctaaaac 


62220 


30 


cagctgtacc 


caagtgtcta 


tatacgggaa 


ctggtctggg 


tgccctggct 


tacaagttac 


62280 




cttgtgccac 


acctttgaaa 


caagggacct 


gtccaggctt 


ccttctgatg 


gccaacccac 


62340 




ctccaatact 


ggccagtcta 


tttcacacaa 


agttctagtt 


tttcctggtg 


tcatagtaac 


62400 


35 


actgtaatct 


cccttaaatc 


ctttcttgaa 


atttttcaac 


atagctccta 


gtggtcttac 


62460 




tttgtgcctg 


gcccatgctt 


cctgaagaca 


aaacaccatg 


ctcacactgc 


aaaacaaaaa 


62520 


40 


tgggtaaaaa 


gggcacacac 


acacttttgc 


agtttacacc 


aaaccagaat 


caaaaccaaa 


62580 




atcagagtgt 


caggaaatcc 


aagccgggtc 


aaaaccaaaa 


ccaaagtatc 


aagcaatcca 


62640 




agtcaagtca 


aaatcaaaaa 


ccaaagtgct 


ggtacaggca 


caccatgggt 


gatcaggcca 


62700 


45 


tgcttccact 


taaatggagt 


gggcaagttc 


caaagaccag 


tcttaccaag 


ttttagatgt 


62760 




ccggactcca 


agtgccagtt 


ccttcatggt 


gttcagccac 


tgcattgatc 


ctccacaggg 


62820 


50 


gcctgctgtg 


cactgctctg 


atgaggcatt 


ccactggggc 


aattgcctac 


ctgggagtgc 


62880 




tctcaggatc 


tgcttcactc 


aagctggctg 


gagtccccct 


caggtatgct 


ccacagggca 


62940 




ggcctaagcc 


acctaaggag 


ctgtctcgac 


catccattaa 


tcaccttgtt 


tcctggtcag 


63000 


55 


crcraaccaacfa 


aatotaqcaq 


qacaaaccac 


aqacaaaacc 


cctcagacac 


cgagttaaag 


63060 




aaggaagggc 


tttattcggc 


caggagcttt 


ggcaagactc 


acatctctaa 


aaaccaagct 


63120 


60 


ccccaagtga 


gcaattcctg 


tcccttttaa 


gggcctacaa 


ctctaagggg 


gtccccatga 


63180 




gagggtcttg 


atcaattgag 


caagcagggg 


gtatgtgact 


gggggctgca 


tgcaccagta 


63240 
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atcagaacag aacagaacag gaaagggatt 
atctatagat aacataaccg gttaggtcag 

5 

gcacagggct gtctgcctgt agatttcatt 
ttgggggcag aaactgtgca taagacaata 
10 gcaaacatgg aacagaaagt aggatggtgg 

ggtggttgtt caatgggtat agagtttcag 
gttacacaac aatgtgaaca aaattaatac 

15 

ataatgaatt atatgtgttt tgtaccacaa 
ttgatttaaa gaaaatattg ttctgtttct 
20 aaaccctttt gctcctaggt cagtagctct 

ctccatcagg agacatttga caatatcaga 
gattgtgctg gcataggcca gggagtctgc 

25 

ctactgtaaa gaattaccca gtccaaagtg 
cagatcatta atcagaacgt gttattgata 
30 tagaattcag actctggaaa aaactgggaa 

gacaagagaa aggagaattg ttttaagctt 
ggg a gg att 9 gggggtgagg aagacagtca 

35 

ggttctgtgc taggaacctt acacgtgttg 
ttttaaatta aatagcacag agaggaaaac 
40 gcaactttaa ctggtttatt agcacattcc 

gaagagacca aagagactgg gctccttttt 
actgtttcac agatgctgtg agggtgacaa 

45 

ggcactctgc gaaagaggaa ggggcgccag 
ctgcccctct ccccaggctt tctgtgtgag 
50 acttaagaca agccaattac atttcataat 

ccagccttct aaatggtgct tcttaactcc 
ttaagtgcag aatccatggc cctctgtggt 

55 

ttcattctgg aagctccccc aaggaaagga 
attgccctga ctggctctgg tttccttgcc 
60 ccacccaagg ttttgaaggg gcacagtgcc 
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ttcacagtgc 


ttttccatac 


aatgtctgga 


63300 


gggtcaatct 


ttaacgaggc 


ccaaggtgca 


63360 


tctgcctttt 


agtttttact 


tcttctttct 


63420 


tgaggggtgg 


tctcctccct 


tagagacaga 


63480 


ttactagggg 


ctaggaatag 


gaaggaaagg 


63540 


ttctgcaaga 


tgaaaaagtt 


ctagaggtct 


63600 


tagtggtctg 


tacactaaaa 


agtggttaaa 


63660 


tttaaaattt 


taaaaataaa 


agttgagaaa 


63720 


agcacacctg 


cctttacaca 


gaacacacaa 


63780 


caacatgaat 


gtgaggggat 


tttgttctcc 


63840 


tactgtttct 


ggttgttaca 


catagggtgg 


63900 


taagcatcct 


acagtgcaca 


ggatagccac 


63960 


gcattagtag 


cttttttgag 


aaacctgtcc 


64020 


taccagtctt 


ctcccttgta 


gttaggttgg 


64080 


tgtaaaaggc 


tgaatgagtg 


* 

taagctgaaa 


64140 


actaaagcca 


aaatggaaaa 


aaaaagaggg 


64200 


acagaccgct 


tttaggccat 


cattccgaag 


64260 


tctcacataa 


tccttaaaac 


tcagaaatga 


64320 


tgactggaga 


gggagccttt 


gcttaggtag 


64380 


cctgaaggga 


aagggggtgt 


gattgctcct 


64440 


aatcaaagct 


caggaggaga 


gctgcattcc 


64500 


agatgcaggg 


cacccactgg 


aaacacagac 


64560 


gagcttgggt 


gagcaaggtt 


ggaggtgatt 


64620 


tccattcctc 


ctctcagatt 


tattgttaga 


64680 


gtctgtgtca 


ttcagacgct 


gagaactaat 


64740 


tttcttcaac 


aggagagtta 


atgtgtggaa 


64800 


gacggtgatg 


gttcaggttg 


tgtttctggg 


64860 


ggaaggaagc 


ttgcagggtg 


gggctttgcc 


64920 


tgatacattg 


aagtcagctt 


ccaaagaatg 


64980 


tctgtcctca 


caaaaccagc 


ctgcatatga 


65040 
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gtttactgaa ttcatttagg tcctgctgaa 
tatttcaatg cttccctgga ggggcccaac 
5 gaaaaagagc agccagagga ggaagatgac 

attgaaaggt tgctttcagc ctgccaagac 
aacaaacaaa caaactatat atataaaaaa 

10 

attactatta gctatatatt aatgaatgtt 
tattattatt atggttagtt atagacagtg 
15 ctagcactat tactgttatt agttacatgt 

ttataaatat ttttgctcat ctttatgaca 
taacagatta gaaaactgaa gcttcaagaa 

20 

gtctcgaagt ggcaattcac acataaggct 
ttggctcttt tactgacctt gctagaagtt 
25 aaatctacaa tctgtgagta tcacatcctg 

ctgatgactt tcagaagatg aaggtaagta 
ttaaaacctc tttggagctt ctgaggactc 

30 

gctggaagac ttttctcatc tcactattcc 
aacttgaaac tccttgacag cagttaagag 
35 aaatggaagg agtatctgtt tttaactctg 

gtgagtcact tacctaaggt tgtgtaatta 
tgtttagacc atttgtaaac tgggcggtct 

40 

tccaagctga gcttagcaac aaataccaca 
gaggaccaac gacgcgagcc tatgtgttat 
45 tgcagctgtt tctccagcct gagtaaactc 

ttaggaagag gtagaatgat ctagagattt 
agatccaaga ctggttttac agcttcccgg 

50 

actgcaaata tttcttcagc attaactctg 
ttgcattcat aacttaaact ttattttgca 
55 ttactaacat gcagagctgt tgtaaggatt 

tgtctggtaa aatatgataa acatatctga 
ataagtaagc agagttcatt tattagtctg 

60 

gttgtcagta ggactagagg aaagggaggt 
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tctttcccat 


ttgttccttc 


tctgtttcaa 


65100 


ctttctggag 


ggcaaagatc 


tgttgttaga 


65160 


ttgagggtag 


gcagcagttg 


ggcaatcagt 


65220 


ttggggaatt 


agaaaagcaa 


gataaaacaa 


65280 


caatttttta 


aaagtttttt 


tatgttctat 


65340 


cctttaaagc 


agtgtagtca 


ttattgctat 


65400 


gaatattgtt 


ctttctctat 


attatgatta 


65460 


tattgaaagc 


ttcaaagcag 


cataggcttt 


65520 


attctccagt 


gttggtattg 


ctcctctatt 


65580 


cagacttgcc 


taacaacagg 


aaacttgtat 


65640 


ccatgactcc 


tgaactctca 


caaatattag 


65700 


tacagncacg 


gaagtgcagg 


aacatttcac 


65760 


tatagctgta 


aacactggaa 


taaggaaggg 


65820 


gaaaccgttg 


atgggactga 


gaaaccagag 


65880 


agctggaacc 


aacgggcaca 


ggtaggtaat 


65940 


cttgccgtga 


cctcattaga 


ggagtaattt 


66000 


acactagggc 


cttttggaaa 


tagagtgggc 


66060 


tgtccgctgt 


attctaattg 


cataaactca 


66120 


atgggagcag 


agctcttgtc 


tcttcaaatg 


66180 


gtgctttgag 


aatggtgccg 


tgaaaaggga 


66240 


acctacttgg 


tctttatgct 


aaaaagtggg 


66300 


gagtccaaga 


cttgttctca 


taccactgca 


66360 


agacacaggc 


tggcaatgcc 


aggtgggaaa 


66420 


atgcaaaggc 


ttcagactcc 


aggaaagctg 


66480 


ctgtgtgaac 


cagggtgaac 


aataagggta 


66540 


ggccacacat 


atatgcaata 


ggtatattat 


66600 


ttttcttatt 


tattacatgt 


ttttactaac 


66660 


aaaagcactg 


cagggtaaaa 

Zj ZJ ZJ 


attaagaatg 


66720 


gtgctaataa 


atagtattat 


taaacaagaa 


66780 


aaaattcatc 


aaatatatgg 


caatacagtt 


66840 


tgttgcatfat 


tgcaaatttt 


gtgtgaccat 


66900 
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ttgcctctgg acattgagac ctctcaagat 
ttatctataa tggactgtgt caaacattat 

5 

ggttttcact gttccatact atcagtttct 
attggcaaag actgagttac tgtttttgga 
10 gagctaaatc tattttttct tacatagttg 

ccaatgagac catcatagtg ctcccatcaa 
ccgaacccac caaccagggg caggatagcc 

15 

ttattggggt aaatctaatt cagaacgtgt 
tgatatatgt cttgggactg gacatctgtc 
20 agggacttta gggtagaagc cacttggaga 

tgtcttaaaa atatattttc ccttattctg 
tttgtatttt tacaatacag tgcttttcta 

25 

ataatgacat attgtgtgta tgtgtatata 
tgactttttg caaataattt ttccctcact 
30 cctcttttct cctaacgttt ttgagaatgg 

ctcctctcac ccagaattta ttaaagaagc 
ggatctatct tggattggga gggaaacaca 

35 

ttatgtgatc tgaggtgagg cactctgttt 
ttccttgcct catgtacaat cacatgggtt 
40 aaaacttttt ggtaaatgat aatagagctg 

aactacctcc taagacagag cttcagttat 
ccctgtgtta gtctgctaag ggggatggcc 

45 

aacagaacaa tgcacactga ggcctttcag 
aggaaaaata actaatggat actaggctta 
50 aaacccccat gacacaagtt tatctatgta 

aataaaagct aaaaataaaa gagcttccag 
ttccgaggaa gtgaccaaag gggcaaactc 

55 

agggccttct gcactgccaa tactgcagag 
tcagagagat agaaagagag ttaactagtc 
60 gtaaatctca gcacttgctg tgagctggaa 



tcatgttgct 


gccaccctgt 


gatgtgttta 


66960 


ggaaaaatcc 


tcatgaaacc 


aagattgtgt 


67020 


ttctctaatg 


gacaggccaa 


gcctagacta 


67080 


gtcagaacct 


gggagctctg 


tactttttaa 


67140 


gcaacaccat 


catgacatca 


caacctgttc 


67200 


atgtcatcaa 


cttctcccaa 


gcagagaaac 


67260 


tgaagaaaca 


tctacacgca 


gaaattaaag 


67320 


tggagagggg 


ttgggggaag 


tgccaagaga 


67380 


gtgagtgtgg 


agaccctaaa 


attttgctag 


67440 


aaactgtccc 


agaacttttc 


cacaagaggt 


67500 


aatatgagga 


attgattttc 


tattgttgtc 


67560 


tattttcttt 


tacaaaagta 


atggtggtaa 


67620 


ttccacaact 


gcatgcttaa 


> 

taatccaaac 


67680 


gcaaattaga 


ggaaatatac 


aactcctttt 


67740 


aaatgatatt 


caccttttgt 


ttgtctgttt 


67800 


tgtattgcat 


gaagggagga 


gagaaagaag 


67860 


gagttttggt 


ttctagaatt 


gcctctgcac 


67920 


gactggattt 


caggacctgg 


tgggactggt 


67980 


tctatgaggc 


tcatgtgaca 


taaggtctag 


68040 


tgaatctctg 


aggtggatgt 


caggaaggaa 


68100 


acatgagaac 


atgcagtatt 


tggtttctgt 


68160 


tctaaatgat 


gagaacacat 


ggacacgtag 


68220 


agggtggagg 


gtgggaggag 


agagaggatc 


68280 


atacctgggt 


gatgaaataa 


tctgtacagc 


68340 


acagacatgc 


agttgtaccc 


ttgaacttaa 


68400 


agcttcagct 


atagggttgt 


gaactgcagt 


68460 


catgtccagt 


ggagtctggg 


agatgagggg 


68520 


aatgtggggg 


ctttcagatg 


tcccctgttt 


68580 


atggacttag 


aggtcctgag 


agctgggaaa 


68640 


gcaagtctca 


tgagcagacc 


agggcctgag 


68700 
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aggcagaaat gcagaagcct tcctgagcac 
ggcagagctg cacaccctcc aggatgtctt 
5 ggtttgtatc ctcattagga tcttgcagaa 

caaaaaggcc ctgtctttac aaatgctggt 
cagactatcc agatcttgtg tggcatgatg 

10 

gcttccttct ctccaaattt tacccaagtg 
ttcataggac ccttttttgt gagtagagtt 
15 gaggaagatg ccaatagctt agacttccca 

gagcaaggag tcaacggtga atcttgttct 
attttagatc aatttcttat cagactcaaa 

20 

ttcacattat catcttgagc tatgatgaag 
aaaaaatcca tgaattagga taaagttggg 
25 gcataaatgg aagacatcac acactcacag 

gtggggatca tggaggaagg agaacttaca 
cattgactga ttcccaggta aggctgagca 

30 

tcctctgaaa tagctgcaag tctggagttt 
aactgctttc tctaagatgg gggctttctc 
35 ttccttcaac ttcctgcaat tcagtagcta 

caccaagaag tccacctttt tgtgtatctt 
tacacttgga tttggatttc attggttttc 

40 

atgccaagct actacattac actacattac 
taggctgtct aagattttcc ttttccttga 
45 gagattttta tagaatgttc ctaaatttgg 

agtatggtga tgaatttttg ggtggtgatg 
tatcaagggt catgctacca acataatgtc 

50 

ggggtagtat tttccagatt tctcctctgt 
ttttatgtcc ttccctagag aacatcttac 
55 aaaaattatt tcctggcttt taaggtcttt 

aggatatatt tctgctatct tttttttttc 
ggacaaatct atgtatttaa atggaactta 

60 

tcccatgaga gttgtctata aacattgtga 
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ctgccagaca 


cctgagactg 


tggcttctgt 


68760 


ttcatatttc 


tcactccaga 


gtttccacag 


68820 


ggcgataatg 


cattagtgcc 


atgggagcgg 


68880 


tttcatcctc 


aatctcactt 


tctcttccca 


68940 


gtattgagct 


tggggatcat 


tttggcatct 


69000 


acttctacac 


tgttgaactc 


tgcttaccca 


69060 


tctgaggagg 


gcaggatggg 


gcaaagaggg 


69120 


cctgccagct 


tgctatgttt 


gatctgccag 


69180 


cctgtcgggt 


aggatgacag 


gggttgcttg 


69240 


taaacatttc 


ttttgaagat 


catcttattc 


69300 


ctagtgactt 


ctctccaggt 


ttaggcgaaa 


69360 


aaggaacatt 


ttatacaaaa 


aaaaaaaaga 


69420 


agaaaccaca 


ctgttatccc 


ccccatgctt 


69480 


cctctgagca 


ttttgaaaat 


acttttactg 


69540 


gatatctatg 


cccagtgatt 


gacaggaaat 


69600 


caaaaggact 


tagtagtcct 


ttcctcagag 


69660 


ataggtaagt 


ttccctttca 


tagatgtttt 


69720 


tttgtttttc 


ttacatttgt 


gtatttgatg 


69780 


gtccattttt 


cttgtcttat 


taactaaagt 


69840 


cccaaatgac 


ctttttctgt 


tccaagatcc 


69900 


attttgtcat 


gatgtctcct 


cagggtcctc 


69960 


gaactttgat 


agttttgggt 


agcgctggct 


70020 


gtttgtctga 


tttttttttt 


tgatgtttag 


70080 


atgaggtaaa 


gcaccatttt 


catcaagtcg 


70140 


actgacacaa 


ttaaccttga 


ttatgtggtt 


70200 


aaaattacta 


cttttttttt 


taaccattgc 


70260 


accaaaactg 


tctcatttta 


ttaaaaaaac 


70320 


ttttqtttaa 


tgggatattt 


tttgtttatt 


70380 


tgatgttatc 


tgaatgttat 


atggacaaag 


70440 


cctcattgat 


cactggtttc 


tcagggattt 


70500 


tagtgacgtc 


gaaggtactc 


tgtacatgga 


70560 
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atctgacctg 


accattttct 


cttttatctt 


5 


ccacagagaa 


aaggttaacc 


aagcttttgg 




tggatcaaaa 


taaaacttag 


acagaactct 




y ci ci a ci v_» t.^c*.^j 


htrraaaara 




10 


tgtatccttg 


ttcagagaac 


atttctccaa 




aggggattat 


aatattccct 


ctgcccatgc 


15 


gtttatgagg 


ttctttcggt 


gataactggc 




gctgacagtg 


ttataaagcg 


gtctattggt 




aarrrttaaa 


aaaaaaat era 


crt caaacrtat 


20 


gaggctacta 


cagtatacta 


cattacattt 




ctttctcaga 


ttttcctttt 


ccttgaggac 


25 


gtatcctttc 


atctatctca 


ccagaagtat 




ttgcacccta 


aaaagttgtc 


cttagtcatt 




araacaaaaa 


acaaaac t t t 


accccrt c t tt 


30 


aagtggcagg 


gagacaaagc 


ctggattata 




attaagtctt 


cgctcaatcc 


attatctctc 


35 


aatgtcatac 


catgeaattg 


caccgagact 




ccaaagccag 


caaggaagtg 


agaaaeggea 




oo t - a t - 1" haaa 

y ' — j O C*. V— k— U CA. d 


aattattatt 


taaaaccctc 


40 


tgctgtttaa 


tttaactttc 


tttgaggatg 




aaatgaaatg 


taactttcta 


tgatgatgga 


45 


cagtcacatg 


tggctatcga 


tgatgtgaaa 




attttacttt 


aaatatagat 


agtctatgtg 






■hcjpffcfccaaa 

1— w vA. C4. 


cttttaaaaa 


50 


aatcccacag 


tgaacatgea 


ggaagctcaa 




agacttacca 


gagacatcga 


ggctttagcg 


55 


cctttttgag 


attgaagaag 


agtgccatct 




tagactctag 


gcatcaagaa 


tactactggc 




tcattatgtt 


gggaaattgg 


ggtcccagag 


60 


tgcccatgtg 


ggattaagta 


acattctact 
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agtttatcat 


ctctggctct 


ctatcaatcg 


70620 


tgagtaagag 


agctataacc 


cacaacccaa 


70680 


ttaaggctag 


tctttctact 


ttgaaaagtt 


70740 


gttagaatta 


ggagctgggc 


ctggatcctc 


70800 


ttattgtacc 


ctacatcaaa 


gtctgeatte 


70860 


cgaagaatgt 


atcacagaga 


aattgtgcct 


70920 


cttcaaattc 


aggttttcag 


tggcaaggaa 


70980 


tggggtccat 


tetttaagee 


caggtgttac 


71040 


tgttcatgtg 


aggtatccaa 


agtagacaca 


71100 


aggectgatg 


tctcctcagg 


ttcctttaga 


71160 


ttcaatagtt 


ttgggtagtg 


ctggctgact 


71220 


aatactttta 


ttttgtttga 


gtataaattc 


71280 


tgtattggct 


aacaaaaaaa 


caaaacaaca 


71340 


atcccttatt 


ccagcaaaaa 


ctagagttgg 


71400 


gggagaatct 


tctcttgtct 


ttaaagtttc 


71460 


aaggggttga 


tgttggagat 


tatgtgagag 


71520 


caatttggaa 


gctctggcta 


caaaaatctc 


71580 


tccagaaaaa 


tgecaattet 


ttttctcatc 


71640 


agttagactc 


ggaactacta 


actagatctg 


71700 


gaaatgaaat 


gtaactttct 


atgatgatag 


71760 


aatgeactae 


agctgtactg 


tacaatatgg 


71820 


tatagtgcaa 


ctgaataaaa 


atttctctta 


71880 


gctattggca 


accatactga 


gtagcctggc 


71940 


gaaatatatg 


ggcctgttac 


tccagagaac 


72000 


ggatgcagga 


tgtggtctcg 


gggtgggctc 


72060 


agttctactc 


ttgtcacaca 


ttgcaccagc 


72120 


caagacaaat 


ctagcttatt 


ctaagaaata 


72180 


tactaggagg 


cccaggagtg 


tacacttcag 


72240 


tatcaactgg 


tgagttccta 


aaagaagaag 


72300 


gtatggctgt 


gaggtgtgcc 


ctggagccta 


72360 
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cctacctgta actatagctc tcagagtact 
ggagtacaaa agaagtcact tctgaggtta 
5 ataaatgaga accagagtgg gaaataaaat 

aaggcaagtc tagtactgtg acttcaatat 
acttcccgct gcctcccaca cattacacca 

10 

cagccacccc atactgcact cagtgtggcc 
gcctgagact cagaaccacc ccctcactca 
15 atgggcaggc gagagtgagc tattaggttg 

aaccacaatt acttttgcac catgctaata 
ttccgccagg ggtttttgag ttgagcgaga 

20 

cactttgggt cacgctctct tggtaatgtt 
ctgtcctggg gaaagaattt aggagagtaa 
25 taaggtgaac cagttcgtat agtcattcat 

atttttgctg gacactggag aggcatacag 
gggacagata agggaataac ttacaattta 

30 

cattagcaat gatccttacc caacatttct 
cattctgagt gctctgtctg ccctggtggg 
35 cttaaatcct gcctcactgc agtgtgagtt 

tgtttcttac ttttatcatg attcacttta 
v tctggctgta agtattttta gatgggatgt 

40 

atttcttgtt attccttctt ttgaaaatct 
gaaagccagg tttatgtaaa tcaaagggga 
45 ctaatagagt ggaattgaaa atgttaaata 

tttgtagagg ttcttaatta tcaaactaat 
tcaaaatgag aatcttggta catggagtgg 

50 

cccaggagat cagaataggc caacaaggaa 
tggtttgaga agacatcaga ttcagtacta 
55 aagataaggg cttcttcttc taatgatcac 

gcaacattat tgttgctttt tgttatagct 
taagaaatat agggagatgg agagtgtaat 

60 

ttggaagaag aaaaataatg aagtttggct 



rttaactcaa 


ttctcttttc 


tatatgaqtt 


72420 


crt~fct~at"cfcafc 


tcccttcact 


tcataaatqt 


72480 


caaaaaacta 


attacccaaq 


ggaataaacc 


72540 


tctcracctcc 


acrt tcacracc 


tttttcttgt 


72600 


tCCttaQQCC 

W** Vp^ >tip* 


tqcacc tacra 


cctctgccca 


72660 


aat ca t atcrc 


ctqcccatqt 


tccaqqqcct 


72720 


t c t t taaaaa 


a to tcrc tt tt 


qaqaqaaqca 


72780 


crtcrcaaacat 


aat tQtcrcrt t 


ttatqqcaaa 


72840 


rataatctat 


cttttccatcr 

^BP* ^pP> Kp* ^p* ^p^ ^p-* ^p^ Vp»« 


tqqctqatqc 


72900 


aaaaaac tcrc 


atctacrtaccr 


t tgccacctt 


72960 


acaaataaaa 


cctcatqaaa 


gccctgctat 


73020 


cattctctac 


cacaccactg 


aggtcctget 


73080 


t caaccaaaa 


attaccctqq 


agattgagag 


73140 


caataaataa 


tcaatcccao 

^p»> W*L fppy*L ^pp* Xpp* ^pi* 


acctcctqaq 


73200 


ataacaaaaa 


tqcataacaq 


attcatgagt 


73260 


c tt tcaaata 


cataqcaqcc 


tqqttqqaaq 


73320 


tttcattatc 


ctqtctqtca 


aacaggccac 


73380 


aaacaaaaat 


aatataccaa 


caagaagtta 


73440 


taccacQQac 


tqctatacaq 


ccaaagccag 


73500 


t ctaat ct ta 


tcraQCftat t c 


aaaagccttg 


73560 


ctctattQQt 


tctciqtttqcr 


qcatctqgqg 


73620 


ctacaqqqat 


qaqattaaqq 


ggattacata 


73680 


aaqttqatqa 


tgaaaaac ta 


aaacttactg 

•ppp* 


73740 


ccaattttqq 


aggcataaat 


tatgagttta 


73800 


taaattaaaa 


atqqaaacaq 


aacaaaaaaa 


73860 


acaatatqtq 


toe at aat aa 


qqaqttaatt 


73920 


aaaaaaaoaq 


aataaaaaat 


atqqqatqaq 


73980 


aataatgaat 


aatactgatc 


aggtgagaat 


74040 


gaaagaagtg 


tgatatatta 


aatcaggaga 


74100 


tgagaatgat 


tgcaggagaa 


ttctatctct 


74160 


aatagaaata 


atatgtctga 


agttgaaaga 


74220 
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ttaaggggca tgtgcatggt cagcaacaca 
agggacggct gggtcagtgg ctcaccccta 

5 

atggatcacg aggtcaggtg ttcaagatga 
actaaaagta caaaaactag ctgggtgtgg 
10 gaggctgagg cggagaatta cctgagcctg 

tcacgccact gcactccagc ctgggtgaca 
aaggaataaa gaaattgcag aatatttgtt 

15 

agaactcagc tggaagatgg acagatagaa 
ctcattataa agagggcctt gctcccaagg 
20 atgaggctga cagcaattgg cacagattaa 

aaattctgaa gagaaatttc agagaaattt 
acgattttac attggaaata agagagattc 

25 

gtaatggaaa aagttatatc ttctctttaa 
atgtgatcaa attatttccg tttattatca 
30 ggacagaaat gatagaattc attcttgtga 

tttactacct tcactgtttt tttctaagcc 
aaagaaaaaa caacaaagaa acaacaacaa 

35 

tacatagaca gggatgtcaa tactaaattt 
tgatccttac aaatttctga gaaagaaatg 
40 aggagctaga gaaagagaga gacttgtcca 

agactggaac ggagctgctg attctcagtc 
accgagtgaa caatgtcatc agaactatgg 

45 

gaagtagatt agttgattta attactattc 
taaggggttg aattaagaca attgtgaggt 
50 tgtcctttct tcattctcta atggttgagg 

taggcatgaa ggcttcagct gacctaagca 
gaactctctc tctgatgctg atttgcactc 

55 

ctgtgctgcg gtggaaacag gcttactctg 
tcacttaacc ttgcctagtg tatcttatcc 
60 ggtagaagga acaatcagtc agtcatgaga 
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ttgagagaaa 


aacaggcaac 


ataagagtaa 


742 8 0 


taatacaagc 


attttgggag 


gccctggtgc 


^ A m 1 A f\ 

74 34 0 


gcctggccat 


gacggtgaaa 


ccccatctct 


74400 


tggcgggcag 


ccgtaatccc 


agctacttgg 


74460 


ggaggtggcg 


gaggttgcag 


tgagctgaga 


74520 


gagcaagact 


ccgtctcaaa 


aaaaaaaaaa 


74580 


gtcctgcaat 


cataaaaaca 


agataagcca 


74640 


aggtctggtc 


aaagtcatgt 


ttgtatagca 


74700 


taatggtata 


atgagaaaaa 


gtacaagtaa 


74760 


tacctggcac 


agccaattcc 


accttgggaa 


74 820 


ggtagtgata 


aagtataaat 


tcatgatggg 


74 880 


tgtcaagttt 


gcttgctgga 


acaatttcca 


74 940 


ctgtgtataa 


ataccacaaa 


ccaaagtttc 


75000 


gatggccagt 


tctgcacaaa 


gaatgtctca 


75060 


gttggtgttg 


gtgattagca 


actctgacag 


75120 


ttacagttac 


acccccaaaa 


aaaagaaaaa 


75180 


aaatcaagaa 


gaaatacaaa 


ggggttcaga 


75240 


tacaaaggac 


ttgcacagtc 


aaaacctcct 


753 00 


ttagtatgca 


tgtttgatac 


atgaggaaac 


75360 


ggcttacaca 


gctactaagc 


agtaaaggtg 


75420 


ctgggagtta 


ttacgccact 


ctgctttctt 


75480 


atgtctcaaa 


gaagtaagaa 


ataagtgtat 


75540 


atctcccttt 


ctcctctcaa 


ggaaatgagt 


75600 


gtgggaggag 


cagttttatc 


ttcctgatgg 


75660 


tggtcttcat 


ctctatggtc 


acctctcaac 


75720 


acatcaatct 


tttatttctt 


gactttcagg 


75780 


tgctggaatt 


ctgcctagct 


gtgctcactg 


75840 


acttccctgg 


ggtgagtgtg 


ctggccggct 


75900 


ctgcactgtg 


ttgagtatgt 


caccaagagt 


75960 


tacacatggg 


agggcatttg 


cattgtgatg 


76020 
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gaagacagag aagaaaagca gatggcaatt 
tatgaaaata gttaatcatg agaaatcaac 
5 ggaatgagac tctgggaagt taaatgactg 

tgctgaggac actagaacct ggcttgcctc 
ccactagtct ctttaatagt attgacttgg 

10 

agtgaatgtc ttatgctctg catttgcccc 
aagctatatt aaaaatgtat tgttgaatca 
15 tgaattaaaa agccttgatt tatccaaaat 

gtgataaatt gcaattattt gaagaacctg 
aagaattcta tgtatacagt tggaaaaatg 

20 

aagtctgaag acctaacata ttttgtttct 
acattggtaa ttctggcatg tcctcaaaaa 
25 tgacttctta agaaaaaagg gagaaatatt 

gaaaagttaa ccattataga aaagcaaagc 
aatgaacatt aaaaaaaacc attatttcac 

30 

ctcttgattt gcctgacatt gacttcagca 
agattagcca aatagtctga atttccagaa 
35 tttcccagaa aatgtttccc agaaaactag 

agcaaaggac acaacttttt attgtacccc 
tgtggtgctc agactttttt atacaggtgc 

40 

tagtgcataa aggcaagccc catataatga 
caaaataaaa tttttgagtg catattttgt 
45 aaacacagaa taagtcaaca atgacagaca 

catcaaatat tcaaacaaaa ctaatgagca 
tatgaagcat cgaccagttc atacttattt 

50 

tataaatatt aagtttcagc atgtgtgtac 
tacatgagct ccatttcatt gaaagaactg 
55 gcaggggtca tacccactgt atcagatatc 

tccaggctga aactggactg tcagattctt 
cattatgtct tctccctgtc cattgcaccc 

60 

tttacctact gtggacattc aacttttatt 
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gagtagctga 


taagctgaaa 


attcactgga 


76080 


tgagtcaatc 


ttcctatttt 


gtcagcgaag 


76140 


gcctggcatt 


atgctatgag 


tgtgtgcctt 


76200 


ccttataagc 


agaaacaatt 


tctgccacaa 


76260 


taaagggcat 


ttacacacgt 


aactggatcc 


76320 


tggtgatctt 


aaaattcgtt 


tgccttttta 


76380 


aacccctatg 


gacttatggc 


tttatttaac 


76440 


tgtattatag 


agtgtagaat 


gaatactagg 


76500 


gtgatatgct 


ctacttatct 


tggattagct 


76560 


gcatatatac 


atctatcttg 


aacctgattg 


76620 


tctagagtgt 


acttttcctg 


cctcacagtt 


76680 


tgactcatga 


ctgtggatat 


gaagaactat 


76740 


aatcagaaag 


ttgattctta 


tgataatatg 


76800 


ttgagtttcc 


taaatgtaag 


cttttaaagt 


76860 


tgtcatttaa 


gatatgtgtt 


cattggggat 


76920 


aaagcacggg 


gctgtaaatt 


accatttact 


76980 


aacaaggcag 


aatgatcatt 


cccagaaaca 


77040 


acagcatgat 


cattcaatgg 


atcacagtga 


77100 


ttaattgtca 


acaggagtta 


actgatttgt 


77160 


tagtgtttta 


tcctatgtat 


tttaactcat 


77220 


agtctcaggg 


tatatgaaag 


tagctggctt 


77280 


tttaacactc 


gttttgttgt 


tttttccctt 


77340 


taaacccaaa 


ttggaactcc 


attttctccc 


77400 


aacaaaagct 


aaccaggaat 


actcattttt 


77460 


tcttattgca 


agaaagacca 


attgtttgtg 


77520 


acacacacac 


gtagaaaagt 


atttcaattt 


77580 


gtcatgtttc 


ccttgctttt 


acccactgta 


77640 


>_» ^ 1m» w*- \^ \^ C-A 
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tattttcttt 


77700 


tgattttagc 


tttatcttca 


aactctcgtt 


77760 


cccactgaca 


cacaccattt 


cttttattta 


77820 


gacgcttatt 


ggataccaga 


cactatgtca 


77880 
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aataagagaa acagagatga cagtgaaatt 
tgaacacaga ataaagcata ccttcttctg 

5 

gtgtgtgtgt gtgtttgtgt gtgtgtatgt 
aatccacatc ccccatatca atcacatggt 
10 ctcagagtcc acaattggtg tattcatcta 

cccctgccac catgtcttca ccaccaccac 
tacccagtca aacttctggt cttcaaagtt 

15 

tttatcagag aaggcatcca gccctttagg 
tcaacctcac tgccatgtca tccactgata 
20 gcccaagtca agtgagaatg atgcctacct 

gtaaagcagt taccaatata gctgacacat 
gtgtgtttac ttccctttgc tttcacttcc 

25 

caccctctat gtctaaaatt aacttcaacc 
tatactcatg aaatctttca tttcttcaag 
30 ctgtaacccc agcactttgg gcggttgagg 

gacaagcctg gagaccatga ccaaacccta 
caggtgtggt gacacatgcc tgtagtccca 

35 

cctgaaactg agaggttgag gctcattgag 
tggacagagc aagaccttga aaaaacaaaa 
40 aagaaagttg tagaatgctc taaataggat 

cttccctttg caggagaaag attttagaat 
actctactga caccttatgg tgataacgag 

45 

aaggcaaaac aaggtatctg gaattcactg 
acttgctacc tttgtccttt acagtgtctc 
50 tttttcaaaa gttatttatt attgtaagga 

aaggtaaaca tccggtatgg tatgacacag 
gcattatgta accatttact ataatctgtt 

55 

tctgtctgtg agtaatataa ctgcactgct 
cacgttccaa ctgcctacaa tccaagtgtt 
60 tcccttcaga acccagctca ggttggaacc 



agggatatat 


acagggcact 


aactattatt 


77940 


cctaaaagaa 


tgtaggggat 


aaacaggtgt 


78000 


gtatgtgtga 


gagagagtcc 


ttcttttgaa 


78060 


gccaccttct 


gccacaggag 


aggacaatga 


78120 


ctgtactcac 


actccagctt 


tgttcctcac 


78180 


agcactctgg 


ataccagaag 


tcttactgat 


78240 


ccccccgggt 


ctctcttcag 


ctttgacttg 


78300 


caacactcaa 


agccactttt 


tatcctaggc 


78360 


tcttttcttt 


caagtagcaa 


tgccccttca 


78420 


tgagatagaa 


gatgtattga 


gataaccttt 


78480 


gaaagatgct 


cagtgaactc 


gagttttatt 


78540 


atagtccaac 


tttccttccc 


cttccacttc 


78600 


ccaatacatc 


gctgggattt 


ggaacctgat 


78660 


aaagtctggg 


ccaggcacag 


tggctcttgc 


78720 


cgggcagatt 


gcttgagttc 


aggagtttga 


78780 


tctctataaa 


aaaaaaaaaa 


aaaaattagc 


78840 


gctactcaag 


aggctgaggc 


aggaaaatca 


78900 


ccatgatctt 


gccactgcac 


tctagcttgg 


78960 


aaaaaggaaa 


gagaggaaaa 


aaaagagaga 


79020 


gactcagact 


gtgtaggaaa 


catgttttct 


79080 


tattggcctt 


• 

tcccaatttc 


tgcacagttg 


79140 


gaacaacttt 


tctcccaaga 


gaaatagaaa 


79200 


gaggtatcta 


acttgaccac 


aggaaatcac 


79260 


cacatgtcat 


cagtgagggg 


aaactatgca 


79320 


aagtggctgt 


gcttcagtca 


ggagtaggcc 


79380 


cgggtttgga 


gcgcaggtgc 


acaaccccat 


79440 


tgtgtgagct 


catacctggc 


tttgagccac 


79500 


gactctgtag 


gacaggagag 


agaataaagc 


79560 


ctttcagcta 


cccaccacgt 


gtccaccagc 


79620 


tgtcaattgg 


catagtcagt 


aggatcccaa 


79680 
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gatgagtgag cccttggcct caatgatccc 
tggtacctga tggtagctgt gctgctagga 
5 gacaggtctc ctgtgagcat ggagaaggca 

aggacaatac ctttgctggc atagttcaat 
acactcagtc cctgcaggat gcagcaaagg 

10 

agaggcccag acacacagct tggaatgaga 
cccaccatcc cagtcagaga ctcctgctca 
15 gcgagtttgc ccagtggtcc atcagaaaat 

ttgggcttgg ggacccccta ccatgacaga 
tatattgtgg gaattagata aacagtgtca 

20 

gatgcctcat ctttgggatg agggagccaa 
gaagttggct tccattatga ctcatccctc 
25 gttagcacag ggcaaggcga ccacacccta 

atatggaaag atgccaaaga aataccagaa 
ttagtgcatg taatttggga gatgggtgtg 

30 

gccaaatgat gaacacttta cctcccacat 
gagtgccttt ggttccctgg ccactgtcct 
35 tgactacttc catggcggcc ctcagggaag 

gtgccataaa aaaggggaaa gtttccctac 
atccccagta ggtgacctgc acacagatgt 

40 

gaaagaaaat tgataagcaa ctccatgaag 
cagagcagca attctggaaa atgcctaaga 
45 gagtgtccca cccagacact ccagctcaag 

cctttttttt ttttttcgag agagacggag 
tggcgtgatc ttggctcact gcaagctctg 

50 

agcctcccga gtagctggga ctacaggcat 
ttttagtaga gacggggttt caccgtgtta 
55 gatccgccca tctcagcctc ccaaagtgct 

gctgttcttg attatttaaa aaatttaata 
atttaaaata cattttctta ttgctttcag 

60 

tatttctggg atgtgaaatt ggttcaacat 
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aggtcagcca 


tgtggcccca 


gcatgtgttg 


79740 


tgggccttgg 


tggaaacatg 


ggtggtggtg 


79800 


ctgaagtacc 


tgtaagcaca 


gagcaatgag 


79860 


gggcgttttt 


gactgtacta 


cggtaagtgc 


79920 


taggggacct 


ccaggcacaa 


acagaaaccc 


79980 


attagagtct 


gccattagtg 


tgggcttggg 


80040 


gtctgatact 


gaggaagaag 


aacatcagtt 


80100 


agagcaggaa 


cagtcattgg 


ggccccaggg 


80160 


gcacatgtct 


tacagtgcct 


ataccccaat 


80220 


gcagtgcttg 


ggggaacccc 


tacctgcctc 


80280 


cagcatcgtc 


tgctctgcct 


ctgaaatgga 


80340 


cctccgtcag 


tgattgcagg 


tgagcaggtg 


80400 


attgaatggc 


tgatggtggc 


catttgaacg 


80460 


actgtgagta 


aatggcagtc 


gtatctagat 


80520 


tggcaggctg 


ttccatctga 


ctacccaagg 


80580 


gagggacttt 


gtgttgggct 


ctgtgcccct 


80640 


cactctatgt 


ggggtgctgc 


atacataagg 


80700 


cagagggctg 


taggtgagac 


cagagagtct 


80760 


tccagggggc 


tacctcttgg 


gacaaaatgc 


80820 


agattgattt 


gattttggct 


ggggttgcgt 


80880 


tactgttaac 


tttgtggagg 


caattgtccc 


80940 


gggagaagga 


tgatgcatcc 


tgcagggact 


81000 


gactactaga 


agatgggtgg 


aggtataaag 


81060 


tctcactctg 


tcacccaggc 


tggagtgcaa 


81120 


cctcccgggt 


tcacaccatt 


ctccttcctc 


81180 


ctgccactgc 


gcccagctaa 


ttttttgtat 


81240 


gccaggatgg 


tctcaatctc 


ctgacctcgt 


81300 


qqqataacaq 


qcqtqaqcca 


ccacgcccag 


81360 


tctcaatacc 


aattatacta 


tttactgtac 


81420 


aagtttaatt 


ttattatttt 


tggtagggtt 


81480 


atacaaatca 


ataaatatga 


ttcaccacat 


81540 
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aaacagaatt aaaagcaaaa accatatgac 
taaaattcac catccttcat gttaaaaatc 

5 

actcaaaata ataagagcca tctatgacaa 
actggcaaaa gcaggcagca ttccccctga 
10 catcacttct attcagcata gtactggaag 

aaataaaagg catccaaata gaaagagagg 
aacgtgattc tatacttata aaatcctaaa 

15 

aacaacttca gtaaagtttc aagatacaaa 
tataccaata atgttcaagc tgaaagccaa 
20 caaaaagcat aacataccaa gagtagagct 

gaactacaaa acactgctga aagaaatcag 
atgctcatgg atagaaataa tcaatattgt 

25 

ttaaaaaatt gtatatatcg atttttatca 
taacaaagtc tttttttctt atttgatgtg 
30 ctattgtcaa attatatatg aataactgga 

ttaaaaatac atcttcctat tctatttgcc 
tatatgaggg agagtgattt ataatttctc 

35 

tgtgcctgta acaaattacc acaagctttg 
gttctgtggg tcaggataca acactggtct 
40 actactttcc tctctggagc ttctagggaa 

cttgaggctg tctcattcat tgactcatgg 
gtccttctca tgttgcacct ctgactattc 

45 

tgggaaaggt tctccactta taaggactct 
caggataaac tctgtgttaa gatcaactgc 
50 aaaatcattt gccagatatc atagcatact 

ctttgggtac tattattctt ggtaacatac 
tagaagaaaa ataacaactg tgtcctggcc 

55 

catgtggtga ggcaatgagt cataaccaac 
ggtgtgacat gagctgatga aaaagaactt 
60 acaaatttgg aagttttaaa atggtaccct 
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atctcaatag 


atgcagaaaa 


aggtttttga 


81600 


cctcaacaaa 


ttaggcatca 


aaggaacata 


81660 


acccacagcc 


aacagccagc 


atcatactga 


81720 


gaactggaaa 


aaggcatgga 


tgattattct 


81780 


ttctagccag 


agcagtcagg 


caagagaaag 


81840 


aaggaagtca 


aattgtctct 


cttcacagac 


81900 


gactccagca 


aaaggctcct 


agaaatcatt 


81960 


atcaatgtag 


aaccattact 


agcatttcta 


82 020 


attaagaatg 


caatctcatt 


tacaatgcta 


82080 


aaccaagtag 


gtgaaagatc 


tttacaataa 


82140 


agacgacatg 


aacaaatgaa 


aaaacagttc 


822 00 


taaaatggcc 


atacttctcc 


atgaatttct 


82260 


aaagctttgt 


ctgcatacct 


ctgttgagat 


82320 


tttctgcagt 


aagtgatttt 


cgaagatttt 


82380 


atgagttcaa 


tttggccatg 


accaaatgct 


82440 


acttttattt 


ttcagtttct 


tacatctata 


82500 


attatcacac 


tgcttatatt 


agttttctat 


82560 


tgacttaaaa 


caatacaatt 


ttatcttaga 


82620 


cactgggcta 


aaaatcaagt 


tatctacaga 


82680 


aaaaatccat 


gaccttgcct 


tttctagctc 


82740 


gctttttcaa 


ggctggcagt 


ggcagatcat 


82800 


ttctatcctc 


aattctctct 


tagacttacc 


82860 


tattattaca 


ttagaattgc 


cataattctc 


82920 


ttagcaaact 


tgattctatc 


tgtaactttt 


jr-v. ^i. jr-v a j-^ 

82 980 


ctcagattcc 


agggagtagg 


atgtggataa 


83040 


tgcctttttt 


gatgttgtta 


ttgcagaaaa 


83100 


gacactggct 


gggagctcat 


tgtcagtaga 


83160 


gtgggtttct 


ttttgtccca 


tgaaagattt 


83220 


aattcaagta 


ataaatttat 


ttctattttc 


83280 


gtttcctggt 


tccttgagaa 


gttgtggggt 


83340 
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tgaaaggtgg gaaatcaaat gcataatcag catcagcatt acattaaaga gtacacgtgc 83400 

aggtttgtta catgggtaca ttacatgatg ctgaggcttg aggtcccaaa aatcccatca 83460 

cccaggcagt aagcatagta cccaacaggg gctcttcaac ctgttcccct ctccctccct 83520 

tccccatcga gtgagtggtc ctcaatgtct attgttccca tctttacatt catatgtatt 83580 

caatatttag ctcccacttg ctagtgaaaa caatgcagta tttggttttc tgttcttccg 83640 

ttcatttgct taggataatg gcctccagtt ccatccatgt tgctgccaag gacaagcttt 83700 

cattgttttt tatgatcgca tagtattcca tgttttatat gtaccacaat tcctttatcc 83760 

aatccaccat ggacaggccc ctaggttgat tccatgtctt tgctatcatg agtagcactg 83 82 0 

caatgaacat atgtgtgcat gtgtcttttt gatagaatga attattttcc tttgggaaaa 83 880 

cccagtaaca ggattgctga gtcaaatggt agttgtgttt ttaggtcttc taggaatcac 83 940 

cacactgctt tccacagtgg ctgaaccagt ttccattccc accaacagtg tgtaagtgtt 84000 

ctcttttctc cacagccttg tcggcatctg ttgttctgtg actttttagt gatagccttt 84060 

atgactggtg taagatggta tcttattgtg gttttgattt gcatttctct gatgattcat 84120 

gatgttgaac attttttcat atttgttggt tgcttgtatg tcttcttttg aaaagtgtct 84180 

gtgcatgtcc ttttcccatt tttaattgga tttttttttt tttttttttt ggggtttggg 84240 

gtaaaaat 84248 

<210> 78 

<211> 38736 

<212> DNA 

<213> Homo Sapiens 



<220> 

<2 21-> genomic_DNA 

<222> (1) . . (38736) 

<223> n is an undetermined nucleotide (dATP , dCTP, dGTP , or dTTP) 
<220> 

<221> init ial_coding_region 

<222> (7631) . . (7729) 
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<220> 

<221> coding_region 
<222> (10476) . . (10682) 



10 

<221> coding_region 
<222> (12601) . . (12687) 

15 

<400> 78 

nccattcatt ccaccatgtg aggacacagt 
20 taccatcttg gaagcaagca acgtgaccct 

cgtggacttc tcagttccaa aactgtgaga 
tttgtggtat tctgttatag cagcacaaat 

25 

ggagtactgc tgtatcgaac acctaaaaat 
tagaggctag tctctaacag acgattctgg 
30 gaacctcagt cttagagatt attttagtgg 

cagtgagtgc cattctgatg tggtgtcaga 
gggccaggcg cagtggttca atgcttgtaa 

35 

agattactgg aggtcaggag ttcaagacca 
accaaaaata caaaaattag cctgtcatgg 
40 gaggctgagg ctgtagaatt gcttgaacct 

gcaccactac actccagcct gggtgacaaa 
aaaaaaaaag tggatcaaag gccatccttg 

45 

gtgtctatgt cccagggttt tgtggaaggc 
ggtgaaataa atatctaagt gaagtgttca 
50 aagagagaaa tgaattaaaa gataaaatgt 

ttagaaaatt atcagcctgg ccatgtgaag 
ggtatggcca agctactgtt taaataagaa 

55 

attcactaga acagtggagg ggtgatccca 
ttcccattac aggtgccaag tgctaagttt 
60 atacacacac acacacacac acacacacac 



ttgtcctatc tggaagatgc agcaacaaga 60 

aagtagatgc tgaacctgtg gcaccttgat 120 

aataattttc tattttttat aattacccag 180 

ggactaagaa agaaactggt accaataagt 24 0 

gtggaagtgg cattggaact gggtaatggg 3 00 

taaggactaa gaagaagaga gctgtagaca 3 60 

ttgtaattac aatatcaatt aaaatgtggg 420 

tgaaaatgag aaataattaa aaactggagg 480 

tcccattcac tttgggaggc cgaggtgggc 54 0 

gtttggctga tatggtgaaa cctcgtctct 600 

tggtgcacac ctataatccc agctactcgg 660 

gggaggcgga agttgccatg agccaagatc 72 0 

gtgagactcc atctcaaaaa aaaaaaaaaa 780 

ttatcaagtg gcaaagaata tggctgaatt 840 

agaacttaaa ggcaatgaac taggatgttt 900 

gggaactgac tgcttataat aaaatacggg 960 

ataatcaaaa agtaagtgaa atttaaagat 1020 

ataaaaggct gttcaagaaa gaaaattaaa 1080 

atatatatgt gtaggtagaa gccagatact 1140 

gtgatatttg gaagatcttt ggggctccca 1200 

ttattgacag atgaatggat aaactgtgag 1260 

acactcatat aattcagact taaaaaagga 1320 
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gatcttgcca tttgccacaa catggaagat 
ccagacacag agacatattg cacaatctca 
5 acaactaaca atatagaatt cagagagata 

agaagagaga gagagagaaa actggtaggg 
aagataagta ggatgagcaa atctagagtt 

10 

aacaaggtat tgtatttggc atttttgcta 
catagaaata caaacgagtg actatttgaa 
15 ttattatact ttaagttctg ggatacatgt 

cacgtgccat ggtggtttgt tgcacccatc 
aatgctatcc ctccccttgc cccccacccc 

20 

ccctgtgccc atatattctc gttgttcaat 
tggttttctg ttcctgtgtt agtttgctga 
25 ctgaaaagga catgaactca ttctttttta 

gccacatttt ctttatccag tctatcattg 
tattgtgaat agtgctgcga taaacatacg 

30 

ataatccttt cagtatatac ccagtaatgg 
taggtccttg aggaatcacc acactgtctt 
35 ccaacagtgt aaaacaattc ctatttctcc 

ctttttaatg atcactattc taattggcat 
atttctctaa ttaccagtga tgatgagctt 

40 

tcttcttatg agaattgtct gttcatatcc 
gatagaatat cgttcagtca cccaggctgg 
45 acctctgcct gctgggttca agtgattctc 

caggtgcata ccagcacacc tgactaattt 
cgtgttagcc aggctggttt tgatctcctg 

50 

aagtgctggg attacaggcg tgagccaccg 
gggttgtttg tttttttctt gtaagtttgt 
55 cctttgtcag atggatagat tgcaaaaatt 

ctgatgatag tttcttttgc tgtgtagaag 
attttggctc tttttttttt tttttttttt 

60 

ggctggagtg cagtggcgcg atctcggctc 
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actggagaaa 


cttatgctaa 


gtaaacttac 


1380 


tttatatgtg 


gaatctaaaa 


taacaacaaa 


1440 


gagagtgaaa 


ctgtggttac 


caggggtaga 


1500 


gagagcagca 


ggtcaaaaga 


tacaaagtga 


1560 


ctaatgtaca 


gtacaaagac 


gatagttaat 


1620 


aatgactaga 


ttttagctgc 


tcttgccaca 


1680 


ataatagata 


tgttaatttt 


taaattttta 


1740 


gcagaacgtg 


caggttgtta 


cacaggtata 


1800 


aacccatcgt 


ctacattagg 


tatttctcct 


1860 


cagacaggcc 


ttggtgtgtg 


atgttcccct 


1920 


tcccacttat 


gagtgagaat 


atgcagtgtt 


1980 


gaatgatggt 


ttccagcttt 


atccatgtcc 


2040 


tggctgcata 


gtattccata 


gtatatatgt 


2100 


atgggcattt 


gggttggttg 


aagtctttgc 


2160 


tgtgtatgtg 


tctatatagc 


agaatgattt 


2220 


gattgctggg 


tcaaatggta 


tttcaagttc 


2280 


ccacaatggt 


tgaattaatt 


tacactccca 


2340 


acatcctctc 


cagcacctgt 


tgtttcctga 


2400 


gagatggtat 


ctattgtggt 


tttgatttgc 


2460 


tttttcctat 


gtttgttagc 


catataaatt 


2520 


ttcacccact 


tttttttttt 


ttttttttga 


2580 


actgctgtgg 


cacaatctca 


gctcactgct 


2640 


ctgcctcagc 


ctcctgagta 


gctgagacta 


2700 


gttgtatttt 


tagtagagat 


ggggtttcac 


2760 


acctcatgat 


ccatccgtct 


tggcctccta 


2820 


cgcccggcct 


ccttcaccca 


ctttttgatg 


2880 


ttaagttcct 


tgtagattct 


ggatattagc 


2940 


ttctcccatt 


c tat aQQt ccr 


cctcjt tcact 


3 000 


ctcttttgtt 


taattagatc 


ccatttgtta 


3060 


tttttaggcg 


gagtctcact 


cttttgccca 


3120 


actgcaagct 


ccgcctcccg 


ggttcacacc 


3180 
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attctcctgc ctcagcctcc cgagtagctg 
aatttctttt gtatttttag tagagacggg 

5 

tctgacctcg tgatccgccc acctcggcct 
accgcgcctg gccaattttg gcttttgttg 
10 ttgcctatgc ctatgtcctg aatggtattg 

ttaggtctta tgtttgaatc tttaatccat 
aagtggccca gttttagttt tctgcatatg 

15 

aatatggaac actttcccga ttgcttgtct 
taaatgtgtg gtgttatttc tgaggactct 
20 tggtaccagt accatgctgt tttggttact 

gtaaccattt tattctctat atgtatctca 
caatgaaatt tattttacaa aatgtaagtg 

25 

cttaagacat gtcccctaag aataatattc 
cccaaaattt attaatataa taataatcca 
30 tattccctaa ttataaaaat atgaggataa 

aacacataaa ttatggttca ctcatacatt 
tctactcata gagcagtttc tcaccatttt 

35 

cttttatgac actgactttt gtttttgttt 
gccccatatc ttggacattt cttcttgttt 
40 aaacacaaca tagatgatgt gacttaccaa 

aataacagtt tatttcaatt ttggtggtgg 
atcagttgaa tgtttatgtc ttcccgaaat 

45 

atggtattaa aaagtggggc ctttgcaagg 
aatgggattg gtgctcttat aaaagagacc 
50 gagaggacac agcaagaaga cagacatcct 

gaggatatgc cagcaatttg acgtcagatg 
gtttgttttt tcagccacct agattatggt 

55 

gacaggtggt attcacatgc tattttcatt 
aaaggaatca gtagtcttat gcttttaaga 
60 ttcactgaat ggtgttagta tccattaaat 
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ggatacaggc 


acccaccacc 


acgcccagct 


3240 


gtttcactgt 


gttagccagg 


atggtctcga 


3300 


cccaaagtgc 


tgggattaca 


ggcataagcc 


3360 


caattgcttt 


tggttttatt 


catgaagtct 


3420 


cctaggtttt 


cttctatggt 


ttttatggtt 


3480 


cttgagttaa 


tttttgaata 


agctgtaagg 


3540 


gctagccagt 


tttcccaacg 


tcatttatta 


3600 


ttgtcaggtt 


tgtcaaagat 


cagatggctg 


3660 


gttctgttcc 


attggtctat 


atatcttttt 


3720 


gtagccttgt 


aatatagttt 


gaagtcagta 


3780 


taatatcatg 


ttgtatactt 


taaatatgta 


3840 


cttgatatcc 


tgatacttaa 


tgtctaaata 


3900 


tactatatag 


ctttaataca 


attattacat 


3960 


atatatagtt 


cttattacaa 


tttcctccat 


4020 


tggaggggct 


tatgtgttta 


tatataaata 


4080 


tggttctctt 


gtcctttgag 


tctttcataa 


4140 


ttgtttgttt 


gattttgcat 


ttattttagt 


4200 


ttttagtctg 


gagcatttgt 


ttcatataat 


4260 


tccatgagtt 


caagttaata 


attttaataa 


4320 


cttttcattg 


catcacataa 


ggatgaacat 


4380 


ttactttaat 


caattgattc 


gggtggtgtt 


4440 


tcatatgtta 


aaattctaac 


ccctaaagtg 


4500 


cgataaggta 


acaaaggtgg 


agtcctcatg 


4560 


ccacagagct 


ctcttgctcc 


tttcgaccat 


4620 


tgaaccagaa 


agttggcctt 


catcacactt 


4680 


tcctggcctc 


caaaatcgtg 


agaaataaat 


4740 


atttttgtta 


taacagtctg 


aacagacaaa 


4800 


ttaaaggtat 


atttttaact 


tcataactaa 


4860 


caaagtgaat 


attatgtttc 


aaaaaatatt 


4920 


ttgtcatctc 


atcctctatt 


ctcagcctaa 


4980 
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atgtattagt ctactttatt actcaagtta 
gtataaaagt tttgcatttc ctgggaacac 
5 tcctaattat gtgtcaatgc tcattattca 

ctctttggac ccaaaacaac tatctgagtt 
tcccatctgc atctcagctt tcatttccca 

10 

aatcttctat tccctaagga catctgtcct 
tggcatggtt taagtcaatg tttccccatg 
15 gattcagaga accatcaacc ttgtcaaaac 

aatcataatt tggacagaga tatttcaaga 
taccaaataa tatatgtaag acacaaagag 

20 

taacaaataa gattttatct tgttaaaaca 
agatgactct ttctagctgt tgattcctcc 
25 gcttaattgt cactttgagg aaaatgtttc 

tccttcatta taggaagaaa cttctcactt 
aatagcttat gctgaacatt agcccttcct 

30 

accaggctgt gttttcaact gaggacttcc 
attacaagaa agattaaaag tcttgcaaac 
35 tttaagactt tcttagtact tttcttaaga 

gagttagagg aatatagtaa tggaggaagg 
ttttgttttc cagcaatcca aagaacaata 

40 

agaaagatat atcaaaagag aagagttcct 
tggaggggga aacacgataa ataagagaag 
45 cctggtcctg aaggagaggg ttggagagtg 

tgggaaacag ggctagctga gaaacaaaaa 
aaggccttct tattctctaa ggagggatcc 

50 

tgcccatgga cctgcaggaa caagagatat 
atattgttag agaaacacca aaatttatct 
55 tgtttctgag ctctgccacg ggaacttggg 

agacgcatca aaatgagtag aaaaagtcgt 
ctgatacagt gagactcgat ctggagaaaa 

60 

gcaacaggag tgggccatga gtccagcatt 
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caactgaaag 


acggaaactc 


ggatatcaag 


. 5040 


ataattgtag 


taaactctta 


gataaattga 


5100 


tgaggaagtt 


agagaataag 


aaggccttaa 


5160 


tttgtttctc 


agctagtcct 


gtttcccagg 


5220 


ctctccaact 


ctctccttca 


cgaccagggg 


5280 


tctcttattt 


gatcttatgt 


cagtttgaca 


5340 


tagctgaggt 


gctaagtgca 


aatattaact 


5400 


cataaaactg 


ctctttatta 


gttgacccaa 


5460 


cattttcaga 


tataaaccat 


aaaacaaata 


5520 


aaagcaattg 


atccgaggct 


actatgtata 


5580 


cccgcatgcc 


cgcataaatc 


agtcagaacc 


5640 


tcctcacaaa 


gagcccacac 


gcttcatttt 


5700 


ttcctttctt 


tccttcacac 


ttccctcttt 


5760 


acatatgaga 


aatcacagtc 


acttttgtga 


5820 


tctctggatc 


ccttgaatgc 


tggaatagtt 


5880 


tgttgcccaa 


tactaatgta 


tccaatgcag 


5940 


acatgaattt 


ttttctttct 


atcccaagca 


6000 


taatatatgc 


aaagggagag 


ataagtggag 


6060 


aaaaggagat 


tttgcttact 


agtttctagc 


6120 


tctcactaaa 


aagtttatga 


tacatgcttt 


6180 


agagtacagg 


gtatgagagg 


tatcatggaa 


6240 


gacacatgtc 


cttagggaat 


agaagattcc 


6300 


ggagatgaaa 


gctgagatgc 


agatgggacc 


6360 


ctcggatagt 


tgttttgggt 


ccaaagagtt 


6420 


ctagaactag 


aaatgccagg 


gagacagttt ' 


6480 


tttatggaga 


atgtgcacag 


agagttgtat 


6540 


ccaactcctt 


tgttgtcaaa 


cataaatatc 


6600 


aacac 1 1 cacj 


cr t q aQ t Q c acr 


acrccaQQaaa 


6660 


ggcattttac 


tcgtcctggc 


ctctttcacc 


6720 


ccctgaagtc 


ccagcttctc 


ccttagggaa 


6780 


ctgggtttcc 


agcggggttg 


cccaaggcac 


6840 
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tggtttctgt 


cttacatgac 


teagagtgea 


5 


gaacaaaggc 


aagcaccctg 


tttcagcact 


agggggagaa 


agaggttaca 


ggctccagac 




^ ■}- ,3 r* 1 ,a f~* .sa /*» £3 


^ a. Cl *— j v-^ w Q I— u 




10 


caataaaggt 


cttccaatgt 


atgaagecac 




ttttcaaata 


ttaaaagcac 


tgcaggataa 


15 


aaacatatct 


aagatgcatg 


ctaatagaca 




aaaaattcat 


caaatatatg 


gcaatacagt 
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atcrfccracccfc 


20 


tcatgctgct 


gccaccctgt 


gatgtgttta 




ggaaaaatcc 


tcatgatacc 


aaggttgtgt 


25 


ttctctaaca 


cacaggccaa 


gtctggacta 




gccagaactt 


gggagctctg 


tacttttaag 




caacaccatt 


atgacatcac 


aacctatttc 


30 


tgtcatcaac 


ttctcccaag 


cagagaaacc 




gaagaaacgt 


ctacaggcaa 


aagtcaaagt 


35 


ggagag^ggt 


taggggaagt 


gggaagagat 


gagtgtggag 


acccttaagt 


tttgctggag 






aacttttcca 


caaaaoatta 


40 


tatgaggagt 


tgattttctg 


ttgttgtctt 




ttttctttta 


cgaaagtaat 


ggtggtaaat 


45 


tttcacaact 


geatgettaa 


caatccacac 




gcaaattaca 


gaaaaaatat 


acctcctctc 




GL GL U~ 7-j CI CL L^> 


taccttttat 


ttatctattt 


50 


tgtattgeat 


gaagggagga 


gagaaagaag 




gagttttggt 


ttctagaatt 


gcctctgcac 


55 


gactggattt 


caggacctgg 


tgggactggt 




tctatgaggc 


tcaggtgaca 


aaaggtctag 




tgaatctctg 


aggtggatgt 


caggaaggaa 


60 


acatgagaac 


atgeagtatt 


tggttttgtt 



gagtttggat 


gccatgttgg 


gggcctctga 


6900 


agggagattg 


tagtactgaa 


ggcggacacc 


6960 


aagaaacaag 


caaacctctt 


taactgggaa 


7020 


agggccccta 


gaatctctag 


etatgetaac 


7080 


tctgtaatga 


ctggaggagg 


ttgggtgtat 


7140 


aaactaagag 


catgtctagt 


aaaatatgat 


7200 


aaaaataaat 


gattagagtg 


catttgtcta 


7260 


tgttgtcagt 


aggactgggc 


aaagggacat 


7320 


ttgcctctgg 


atattgggac 


ctctcaagat 


7380 


ttatctataa 


tggactgtgt 


caaacattat 


7440 


ggctttcact 


gctccataat 


atcagtttct 


7500 


attggcagaa 


actgacttac 


cagttttgea 


7560 


agctaaatct 


attttttctt 


ccgtagttgg 


7620 


caatgagacc 


ateataatge 


tcccatcaaa 


7680 


cgaacccacc 


aaccaggggc 


aggatagect 


7740 


tattggggta 


aatctaattc 


agaacatgtt 


7800 


gatgtatgtt 


ttgggactgg 


tcatccttgt 


7860 


ggactttggg 


gtagaageca 


cttggagaaa 


7920 


tcttagaaac 


gttttttccc 


ttattctgaa 


7980 


tgtatttttg 


caatacagta 


cttctctata 


8040 


aatgacatat 


tgtatataca 


tttttctgta 


8100 


tgattctttg 


caaggaattt 


ttcccttact 


8160 


cctgttttct 


tcttaccttt 


ttgagaatgg 


8220 


ctcctctcat 


ccagagttta 


ttaaagaagc 


8280 


ggatctatct 


tggattggga 


gggaaacaca 


8340 


ttatgtgatc 


tgaggtgagg 


cactctgttt 


8400 


ttccttgcct 


catgtacaat 


catatgggtt 


8460 


gaaacttctt 


ggtaaatgat 


aatagagctg 


8520 


aactacctcc 


taagacagag 


cttcagttat 


8580 


agtttgctaa 


gggtggtggc 


ctctaaatga 


8640 



-198- 



WO 02/062946 

tgagaacaca tggacacgta gaagggaaca 
ggtgggagga gggagaggat caggaaaaat 
5 tctggctaca aaaatctccc aaagccagca 

ccagttcttt ctttccctcg ttggtattta 
tccaacacta gatctgtgct gtgcaatgta 

10 

tctatgatga tggaaatcca caacagctgc 
atcgatgatg tgaaatatgg atagtgcaac 
15 aaatataagt agcccatgtg gctactggca 

tgttttcaaa cttttagtag gaaatatatg 
tgaacatgca ggaagctcaa ggatgcagga 

20 

gagacatcga ggctttagtg agttctactc 
attgaagaag agtgccatct caagacaaat 
25 gcatcaagac tactactggc tactaggagg 

gggaaattgg ggtcccagag catcaactgg 
ggattaaata acactctact gtatggctgt 

30 

acaatagctt tcagaatgct gtgagctcag 
aagtcacttc tgaggtcagt ttatttattt 
35 cagagtggga aataaaacca aggaactgag 

tattgtgacc tcaattttct gacatccagt 
tcccacacat tacaccatcc ttgggcccgc 

40 

ctgcgctcaa tgtggccagt catatgcctg 
aaccaccccc tcactcatct ttgagaggtg 
45 agtgagctat taggttggtg caaaggtaat 

acccaaaatt acttttgcac catgctaata 
ttgcacgagg ggtttctgag ttggccgaga 

50 

tgcttgtggt cacactcaca gtaattttcc 
ccagggaaag aatttaggag agtaacattc 
55 ggaaccagtt catatgttca ttcattcagc 

ggctggacac tggagaggcc tacaacaaga 
cagataaggg aataacttgc aatgtaatgg 

60 

ttgggaatga ttcttaccca gcatgtctct 
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atgcacactg 


gggcctttca 


gagggtggag 


8700 


aactaatgca 


cagaaactca 


atttggaagc 


8760 


aggaagtcag 


agagagtgcc 


cagaaaaatg 


8820 


gagattattg 


tttggaaccc 


tcagttacaa 


8880 


actgtctatg 


atggaaatga 


aatgtaactt 


8940 


actgttacaa 


tatggcggtc 


acatgtggct 


9000 


tgaattttaa 


attttactta 


attttacttc 


9060 


accatactga 


gtagcctagt 


acaagagtaa 


9120 


ggcctcttac 


tccagagaac 


aatcccacag 


9180 


tgtggtctcg 


gggtgggctc 


agactcacca 


9240 


ttgtcacaca 


ttgtaccagc 


cctttttgag 


9300 


ctagcttatt 


ctaagaaata 


tagactctag 


9360 


cccaggagtg 


tacacttcag 


tcattatgtt 


9420 


tgagttccta 


aaagaagaag 


tgcccatatg 


9480 


gggatgtgcc 


ctggagccta 


cctacctgta 


9540 


ttctcttttc 


tgtgtgagtt 


ggagtaaagg 


9600 


cctccacttc 


ataaatgtag 


aaatgagaac 


9660 


tacccaaggg 


aacaagctgg 


gtcaagtcta 


9720 


ccaggccttt 


attctgtacc 


tcccgctgcc 


9780 


acctagacct 


ctgcccacag 


ccaccccatg 


9840 


cccaggttcc 


agggcctgcc 


tgagactcag 


9900 


tgcttttgag 


agaggcaata 


ggcacgcaaa 


9960 


tgcggttttg 


ccattgaaag 


tagtggcaaa 


10020 


caccatccag 


attttctgtg 


tggttgatgc 


10080 


gagaaactgc 


atctggtaca 


ttgccacctt 


10140 


aggcaagcct 


cgtgggccct 


gctatctgtc 


10200 


tctaccacac 


tactgaggtc 


ctgtttgagg 


10260 


caaattttac 


at t acacia tt 


CtQaqt tt tt 


10320 


aatgatccct 


ccgggacttc 


ctgagggaga 


10380 


caaaaaggga 


taagagagat 


tcgtggccct 


10440 


ttcaggtgca 


tagcagcctg 


gctggaagca 


10500 
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ttctgagtgc 


tctgtctgcc 


ctggtgggtt 


tcattctcct 


gtctgtcaac 


ccggctgcat 


10560 


5 


taaatcctgc 


ctcattgcag 


tgtaagttgg 


acgaaaagga 


tataccaacc 


agacttcttc 


10620 


tttcttatga 


ttatcattca 


ccttacacca 


tggactgcca 


tagagccaaa 


gccagtctgg 


10680 




ctgtaagtat 


tttttagatg 


ggatgttcta 


attttatgag 


gtttctgaaa 


gccttgattt 


10740 


10 


cttattattc 


catcctttga 


aaatctctct 


gttaattctg 


gtttgggcat 


ctggaggaag 


10800 




gccaaggtta 


tgtaaatcaa 


aggggggctg 


catggatgag 


attaagggga 


ttacatacta 


10860 


15 


acagagtgga 


attgaaatgt 


cagataaggt 


tgatgatgaa 


aaactaaaac 


ttacggtttg 


10920 


tagagtttct 


taattactca 


actaatctaa 


ttttggaggc 


ataaattatg 


agtttgtcaa 


10980 




actqaqaatcr 


ttqatacata 


qaqtqqtqaa 


ttcaqattQQ 


caaacagagc 


agaaagaccc 


11040 


20 


gggtgatcag 


aaaatgctga 


caaggaaatg 


tggtatgtgt 


gtgtggtagg 


gggtcagttt 


11100 




ggtctaaaaa 


gacatcagac 


taaagatgaa 


gaaacagaga 


ataggaaata 


tggggttaga 


11160 


25 


tggtaagggt 


ttctcctaat 


gattacaata 


atgaataata 


ctaatcaggt 


gaaacggtaa 


1122 0 




cattattatt 


gctttttgtt 


atagctgaaa 


gaagtgtgat 


atcttaaatc 


aggagataag 


11280 




aaatataaaa 


aciatcicracia.Q 


tqtaactqaq 


aattatcgca 


ggagaattct 


ctctctttgg 


11340 


30 


aagaagaaaa 


acaatgaagt 


ttggctaaca 


gaaataatgt 


gtctgaagtt 


gaaaaattaa 


11400 




ggggtatgtg 


catggccagc 


aacgcattga 


gaggaaaaca 


ggcaacataa 


aagtaaagaa 


11460 


35 


gttgcagaat 


atttattgtc 


ctgcaatcat 


aaaaatgagg 


tcagccaaga 


actcacctgg 


1152 0 




acgatggaca 


gatagaaagg 


tctggtcaaa 


ttcatgcttg 


tacagcaccc 


attataaact 


11580 




aaaccttact 


t ccaacrcrt a a 


tgqtataatcr 


aggaaaaagt 


gcatgcaaat 


gaqqctgaca 


11640 


40 


gtaattggta 


caaattccac 


cttgggaaaa 


attctggaga 


gaaatttggg 


agaaatttgg 


11700 




tagtgataaa 


gtataaattc 


atgatgggac 


gattttatat 


tggaaataag 


agagattctg 


11760 


45 


ttgagcttgc 


ttgctggaac 


aatttctagg 


taatggacac 


aacatgtatc 


ttctatttaa 


11820 




ctgtgtgtaa 


ataccacaaa 


acaaaagttc 


atgtgatcaa 


attacctccc 


tttattatca 


11880 




aataaccacrt 


tc tQcacaaa 


qaatqtctca 


ggacagaaat 


ggtagaactc 


atccttgtga 


11940 


50 


gttggtgttg 


gtgattagca 


acagtttact 


a'ccttcactg 


tttttttcca 


agtcttaaaa 


12000 




caccaaaaac 


agcaaaaata 


aaaaataaac 


aacaaaaata 


aaaaacatcc 


aaacaggttc 


12060 


55 


agatacatag 


acagggatgt 


caatactgaa 


ttttacaaag 


ggcttacaca 


gccaaaacct 


12120 




ccttgatcct 


cacaaatctc 


tgagaaataa 


atgttagtat 


gcatgtttga 


tacatgagga 


12180 




aacaggagct 


agagaaagaa 


agagacttgt 


ccaggcttac 


acagctactc 


agcagtaaag 


12240 


60 


atgagactat 


aatggagctg 


ctgattctca 


gtcctgggag 


ctattacgcc 


actctgctct 


12300 
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cttactgagt 


gaacaatgtc 


atcagaacta 


cgtacgtctc 


aaagaagtga 


gaaacaagtg 


12360 


tatgaaatgg 


attagttgat 


ttagttacta 


ctcatcttcc 


cttctcttct 


caagaaactg 


12420 


aatgagtgaa 


ggagttgaat 


taagacaatt 


gtgaggagga 


gcagttttgt 


tttcctgatt 


12480 


gtttccttac 


ttcattctct 


aatggttgag 


gtggtcttca 


tctctatggt 


cacctctaaa 


12540 


ctaggcatga 


aagcttcggc 


tgacccaagc 


atcactgata 


ttttatttct 


tgactttcag 


12600 


ggaactctgt 


ctctgatgct 


ggtttctact 


gtgttggagt 


tctgcctagc 


tgtgctcact 


12660 


gctgtgctgc 


agtggaaaca 


gactgtctga 


cttccctggg 


tgagtgtgct 


ggccagcctc 


12720 


gcttaatctt 


gcctagtgta 


tcttatctct 


gcactgtgtt 


gagtatgtca 


ccaagagtgg 


12780 


tagaaggaac 


aatcagtcag 


tcatgagata 


cacatgggag 


gtcatttaaa 


tggtgatgga 


12840 


agaccgagaa 


gaaaagcaga 


tggtaattga 


gtagctgaca 


agctgagagt 


tcactggata 


12900 


tgaaaatagt 


taaaatgaga 


aatcaactgg 


ttcaatcttc 


ccattttgtc 


attgaaggga 


12960 


atgagactct 


gggaagttaa 


atgactgggc 


tggcattatg 


atatgagtct 


gtacctgtgc 


13020 


tgaggacgct 


agaacctggc 


ttgcctccct 


tctaagcaga 


aacaatttct 


gccacaacca 


13080 


ctagtctctt 


taatagtatt 


gacttggtaa 


agggccttta 


cacatgtgac 


tggatccagt 


13140 


gaatgtctta 


tgctctgctt 


ttcctcctgg 


tgattttaaa 


attctttttc 


cttttaaaag 


13200 


atatattaaa 


aatgtattgt 


tgaatcaaat 


tccaagggat 


accaccccta 


tggacttatt 


13260 


gctttattta 


actgaattta 


aaagccttga 


tttatccaga 


attttattat 


agagaataga 


13320 


agaatactag 


actgataaat 


tgtaattatt 


tgaagaaccc 


agtgatctgc 


tctacttgtc 


13380 


ttggattagc 


taagaattct 


atgtatacag 


ttggaaaaaa 


tggcatatcc 


acatgtatct 


13440 


tgaacctgag 


taaagtctga 


agacctaaca 


tactttgttt 


cttctaaagt 


gtgcttttcc 


13500 


tgcctcacag 


ttactcatgt 


cctcaaagac 


gactcatgat 


gctggatatg 


aagaactatt 


13560 


gacttcttgg 


gaaaaaacgg 


agaaatatta 


attggaaagt 


tgatttttat 


gataatatgg 


13620 


aaaacctaac 


cattataaaa 


aagcaaactt 


gagtttccta 


aatgtaagca 


tttaaagtaa 


13680 


atgcatattt 


gttttaaaaa 


attatttcgc 


tgtcatttat 


gatatgtgtt 


cattggggat 


13740 


ctcttggttt 


gcctgatact 


gacttcagca 


aaagcacagg 


gctgtaaatt 


accatttact 


13800 


agactagcca 


aatagtctta 


gacatttccc 


agaaaactag 


gcagaatgat 


gattcaatgg 


13860 


atcacagtga 


ggcaaaggat 


acagcttttt 


cttatactcc 


tcaattgtga 


acaggagtta 


13920 


ac tctat t fccrc 




aqac tct tt t 


at cacaQQtQ 


ctactQattt 


atcccatgta 


13980 


ttttactcat 


tagtgcataa 


aggcaaaccc 


cataatgaag 


tctccgagtg 


tatgaaagta 


14040 


gccggcttca 


aaataaaatc 


tttgagtgca 


tactttattt 


taatacttat 


tttgttattt 


14100 


tttcctttaa 


acacagaata 


aatcaacaat 


gacagacata 


aacccaaatt 


ttaactcaat 


14160 
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tttctcctca 


ccaatattca 


aacaaaacta 


acgagtgaac 


aaatgctaac 


caggaatgct 


14220 


5 


tatctgttat 


gaagagacaa 


ccagttcata 


cttacgtcct 


tatttcaaaa 


taaaaaagac 


14280 


gaattgtttg 


agtataacaa 


acataagcct 


caccttgctc 


atgcacacac 


acgcacacac 


14340 




acacacacac 


gtgtaaaaaa 


gtatttcaat 


tttatatgac 


ctccatttct 


ttgaaagatt 


14400 


10 


ttgtcatatt 


tccttgctct 


gaccctctgt 


agcaggggtc 


atacttcata 


ctcactgtct 


14460 




cagatatcct 


gatactcaga 


gagtttctct 


gctctttttt 


tccagtctca 


aatggaactg 


14520 


15 


tcagtttctt 


tgattttagc 


tttatcttca 


aatctgtttc 


attaggtctt 


ctccctgtcc 


14580 


tttgcacccc 


ccactgacac 


acaccatttc 


ttttatttat 


ttgcctattg 


tggacattca 


14640 




acttt tattq 


atacttattg 


gataccagac 


actatgcttt 


taatgccagg 


gctaccaaag 


14700 


20 


tgaataagat 


acatgatttg 


gcttcaagat 


atttatagtc 


aaataaggaa 


aacagagatg 


14760 




acaatgaagt 


tagggatttc 


gctaactgtt 


atttgagcac 


agaataaaga 


cactaactgt 


14 82 0 


25 


tatttgagca 


cagaataaag 


aatatcatct 


tctgcctaaa 


agaatgtaga 


ggataaacaa 


14880 




gtgtgtgtgt 


gtgtgtgtgt 


gtgtgtgtgt 


gtgtgagaga 


gagagagaga 


gagagagggg 


14940 




QQcrcrcraQaQa 


aagacagaaa 


agacagaatg 


tacttcctt t 


gaaaatccac 


ctccccacta 


15000 


30 


ttaatcacat 


ggtgccacct 


tccgccacag 


gagaggacca 


ttgaagcatt 


gtcatttgtc 


15060 




tggggtaata 


tcagaggttc 


attgtctcat 


gccaagaaaa 


ttgaagacat 


ggacacacaa 


15120 


35 


agagtgaatt 


taagagcaga 


agtttaatgg 


ctaaaagaaa 


gaagagtgct 


tcctcatgca 


15180 




gaggaaggga 


tcccgaatgg 


atttctgggt 


ttggggcaag 


atgaggctgg 


ttttatagat 


15240 




craat ctcraoo 
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tgatttacat 


caqqcataqt 


agattggttq 


gaccagatgt 


15300 


40 


gccatttaca 


tagcatggga 


agtgctggcc 


atcccacact 


aatcttttat 


tatgcagatg 


15360 




gggtctccac 


ctggccggca 


ccatattgcc 


tgcttttttg 


ctgcacacat 


ggtgacaaag 


15420 


45 


aaaagggaag 


agggaacctc 


catgcttaat 


atatctagct 


tccaggtatc 


ccttttctat 


15480 




tggcacaact 


gctggcattc 


acctttgcaa 


gcttccagct 


tgcttatcta 


ttcttgcagc 


15540 




tcroatt tt tc 


* 

acrQC ttct 1 1 


tqc taQacaa 


gaaatgattt 


qqqqqctqct 


ttttaataaa 


15600 


50 


aagaaaatct 


tactgagaac 


tcccttaccc 


tcactagcta 


cttaaataat 


ttctttttag 


15660 




ctcctgtgtt 


atttctcccc 


tcagaagtag 


taaccctaac 


tgctgttaga 


gggtgttgga 


15720 


55 


cagtgactct 


ttctgactac 


ttcctgctga 


aaaggggcat 


ctagtgggga 


cagcagctag 


15780 




ggctcctcct 


gagttcaatc 


taacggtcct 


cagaagaaag 


gcgtgtccgt 


gtgtggttct 


15840 




gcctgcagca 


ccatttaaag 


tttgcttcta 


ggccaaaaga 


gataaatttt 


acaagaaggt 


15900 


60 


ttagaataca 


gggtttgaat 


atgagtatta 


agattaccat 


tattagtggg 


ggtaccatag 


15960 
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gccataacca tgccagtgga gtttgatacc 
gtttgcctcc accaggtgtc gcttacttta 
5 ttttgcataa gtgtccccca attattaatt 

ttacagcaat tagaattttc caaccggaat 
gtgttgcaaa tagtagagcg catgtatcaa 

10 

acgtctaaaa cttttactcg ccaagatgta 
aaatgtaatc ccatgtataa ttaaaatctc 
15 atatctggct gcgaatctcg agaggaaatg 

aaggtgggac tgagtaagat gggtagccct 
gcttaagatc tcctattttt tgtcattgat 

20 

ttgctctctc agctttccag gctttgactg 
taacttttac tgctgagggg gttgaaagaa 
25 ggaagagaga gagaagggag agctttcacc 

gatcctattt cttggagttg ggcttttgct 
agagaggtta catgctcaac caactcagag 

30 

aggaaagacc atccatacag cattttaaaa 
gaggtattta agtagaccac agcggggagt 
35 gatgcctttt ctgtatggca tgaaaatgct 

accaggagat attggatgcc cttcatcttt 
tcccctggat agcttcctat tctttgggtt 

40 

ttatttttaa gacagatttc acaagcatta 
tctcctgaaa acaatctctg ggcacattga 
45 tgatggagga ttttaaggac tttccattgg 

tgactgtagc catcctgaga gctggaaaat 
tgcagggaag tactgaggtt taatttcttt 

50 

agtccataga tgtcttgagg cttccttgct 
tttccttcag ctacttcatc ttttcccttt 
55 tttcatggaa ggaaaactga ggatgataac 

tgttttgttt tttttgtttg tttttttgtt 
tcactctgtg gcccaggctg gttgcagtgg 

60 

cctgggttca cgccattctc ctgcctcagc 
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17760 


ctcccgagta 
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caggtgtgcg 
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ccaccacacc aggctaattt tttgtatttt 
aggatgttct cgatctcctg atcttgtgat 

5 

attacaggct tgagccaccg cgccctgccc 
agtaagaaag tgtctttcct tccaaatggc 
10 cttggagtgt aaatattacc tacctttatt 

atcagttcag ctaattgact gcttgtggct 
tttagagtgg ctactgcata tcctgactgt 

15 

gtctagtctg gattttctag gagagtttcc 
attataactt gttcacagtc atattcaggt 
20 gtgtttaggt gagaacaagt gtttaactca 

taaggagctg gctggctgtt agccaaaggc 
tatgtggggt gtaaatcatt tcccagggtt 

25 

actgtggcaa tggcttggag gcatgtataa 
gttgagaaaa atattagatt agaatttttc 
30 aagaggggag gcctggatta gacaggagaa 

taccttcctt cctttaattt ccagaatcac 
gagccactgg agctggaggt ttgagcccca 

35 

agactggttc tgaaactggt gacttctgtc 
tgctacagtc tggacagggt caaggtgact 
40 ccctgacttg ccacatcaat agcatctagc 

agcctgcaaa acagctactg gaggctctgt 
ggcatcctcc tgtccctatt taaaagacca 

45 

tgctatctgg tcctatcacc tgctgctgta 
taataaactt gttctttagg atgagccatc 
50 ttttattagt gtatctctca gcctttccat 

gtctatcatg gacagtttag aataattaaa 
taatatacac attaaaagtg tttcctttac 

55 

tcagggttgg caaaaagaac tgcttccctt 
tcaccttctc tgtctccttt tctccttttc 
60 ccaaatttct ctgctagctg cagagctgcc 
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19260 


aaaggctgca 
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cagaactatt 
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19440 


cctatttgga 


atggtgtttc 


cattattcct 
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ttgttcatct 
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agaagcagca taacatccct ccatgcgagg 
tctatatacc tattggggtc atcagaaaat 
5 cctgaaatga gcaggaaatg tgaatcctaa 

ggggtaagag tgaagcaggg aaagtgggta 
gtggcatggt tagtgggggt cccagataag 

10 

ctgtctcaga ttgtccccct ggaatatgct 
ttcctgacat ggctgctaaa agagctaggt 
15 tcttgcaggg caaagaaagc ctgtacatag 

tagaaaagat ctaattgttg gataatatta 
ctgttcatcc ccaagatggt aagaagttgt 

20 

aagtctcagg gtcagaggag tcccaatgtt 
aaagacaatc tgttatgcat ctagaaagag 
25 ttagtctcat tccttttggt atgacacagg 

ctactgtttt ctctccttgc ttcctgtggt 
gttgcaggca taacccccca ccccagccat 

30 

tcatgcatac ccatgcagcc ctagttctcc 
tgtatgatct gcatggctcc ccaaaagatg 
35 tttgggcaag actgctttaa tgggaggggt 

atggcccatg ctaaagtgtt tacccttaga 
atccccttac catttaagta ccattctaac 

40 

caggaaccaa atggtggttt tcctgctaac 
atttttctcc taatggttaa aggtagagtt 
45 tggtctcttg tggaggggcg cacaaaggga 

aaggactgac aatgtgccca gtgagaggat 
agaaatacca cgtgcttgcc agatgaaggg 

50 

cttctgctcc tagaaaatca caaaaatggc 
ggcatttgct aatctttcct aacaggacta 
55 ttgcatacag aggataagag atatggcagt 

gaaaattgga tatcctgttg ctgatgccgg 
gaaggcttca gataacacca gggtgtaacc 

60 

ctcttccagc cccatgcaac agctaggtac 
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ccaacatgcc cagtaacctg ggggattggc 
gacttgagaa cagcaggtgg tttgccctaa 

5 

atttttattt gatttggttt taaaatggtg 
tgaggtccac ttctatactc aggagaagaa 
10 aaagatttct ttcaattcac ccagagtgaa 

caataaagtt aggatagcac aaaaagcact 
tgagaatcaa cttatgataa gcttttaata 

15 

cataagtaat catctctaca tttagcagat 
tatccactag actctccagt aggcagagtc 
20 actaggtctg cttagctccc gtattccata 

ggatccaaag tacaccttct gcctcctagt 
tcccttggct tttcttctta aaggaaagcc 

25 

atgatcttcc taagggtcca tgagggatag 
ctttcagaag ctgactcagt ttattacttc 
30 ctctttctct tctaatgaat gaaaagctag 

cagggcgtgg taggcagcta tagtcctgat 
cgggtcattg gctgagatca gctggacctg 

35 

cagtcctgag cctgtgcact gaggcctctt 
ttccaaatgc caggtacagg agaaacccag 
40 aagacacgta aggacaaagg atgtcaagat 

agtcttcctt acttgacctt ctctccagcc 
aaattgcccc atggacataa ttgtggtgca 

45 

gacttgcgtt tcactaccag tccctaaaag 
gaaagagcct ccctgggtac ttggactcca 
50 cagtgttgca tgtcctttcc cagcttcctc 

* 

tctcttgcag tactcttgga gcttttctcc 
tttttttttg catttcaaga cacatcactt 

55 

tttcaaatcc attgtttttc atgcataaat 
ttgtctgtct ggtcatgcac ttcatccatt 
60 gccttattat agatgttata aagatttatc 
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ttgtttatca gatttttatg ttaatttaaa 
ttttatatgt gtagaaattg cttccaaact 
5 tttcttattt ttttaatcac ttggcatttt 

atagtataag ttatagtata agtaatatga 
gaaataaaat tccaagttcc ccaatcaact 

10 

caaaattaac ctgaaaaact agttcaggcc 
attatacaat cctttctttg gaattcaggc 
15 gacctcaaga ttgatggaac agtctcttta 

acgctgaagc ctgctaccag gagtcttcaa 
ttaagccaga cactcccttc tattgattcc 

20 

gccagccaga aaatgtttcg agctacttat 
ttggggcata gcttggtttt atacatttta 
25 aagaggtaca ttggttctgc acagaaaggc 

tggttatagg taggtaagag acaaatggct 
cataggaggc aatcacatct gcattaattc 

30 

ctgcccttgt ccacaaggaa attccttgtg 
tctttatttt aggaatagaa tgggaggaag 
35 ttccttttgg cttagtgact tgggggtacc 

tacttttgag tttacagtac acacacacac 
ccacttgttc tattattact tactgagtta 

40 

tttttaaaaa tacactcaac atttatttac 
catttttttg tcattcagtc tatttcaaca 
45 gcattttaat ttctgacagt acaatcctta 

tgggcctggt gcggtggctc atgcctgtat 
gatcacctga gatcaggagt tcaagaccag 

50 

ataaaaatac aaacattagc cgggcatggt 
aggctgaggc aagagaatca cttgaaccta 
55 caccactgca ctctagcctg ggtggcagaa 

aaaataaaat gcttttggct agtaagactc 
aagttccaca agggttttat ttggaagcac 

60 

aattgacaac cagaaatatg tcagtacagt 
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25 


gtcactgtgc 


tgcttatatc 


cttctgtggc 


tacacattgt 


ctctatgaca 


aagcccaatg 




tctccaggaa 


gcattcaaag 


ccctgtaaat 


ctctcatttt 


gcacatccag 


gatcttatct 




ctctcccaac 


taaatctatc 


acttctccct 


ctatcrataQa 


at aat accaa 


agtgtttcca 


30 


ggtctgctag 


taagccaaac 


tgtttcagaa 


ctctagtctc 


tgcacattga 


tccctctacc 




gagaatgcat 


atggctacaa 


gcacctgctg 


ttctttgaaa 


gcatgatttg 


aaagttaatt 


35 


ttcaatggag 


cctcctcttc 


caagagaagt 


ttaatatctg 


acctttgtgc 


tcccatgatg 




ctccctatgt 


actgtgtgac 


ctttgtgctc 


ccatgatgct 


ccctatgtac 


tgtgttcaag 




crcrcaacrcraca 


ttatcttcca 


caQQt at tQt 


taqtctc tacr 


cacitqttQca 


tqcaccaoat 


40 


acttaataaa 


tattggctga 


ataaaataat 


tctgaatggt 


agctcatttg 


gactagaaac 




ccaggataac 


tctagaaagg 


tgggacaaaa 


atggcagtct 


tgttcacata 


tacagtggct 


45 


gtcacttagg 


agtccctccc 


gtgtgaagca 


gagaattgtt 


gctgtacaaa 


tctgttttca 




aacatgcctt 


catggaaaga 


gagtaacagc 


aacatggaaa 


aatatgtgaa 


tttggaacaa 




aacjcaaaata 


aQataacaaa 


acaataQtaQ 


acaatcrt aat 


tacttgatat 


caatqetaqq 


50 


cagtgtatta 


gcatatttct 


tttgcacaat 


tgggtgcaca 


atagtagcat 


taaatgeege 




agcctgtgat 


gtattcatca 


agactcctca 


actcagtatc 


aatttcctac 


agectagaaa 


55 


aactgtttgt 


ctaagtattc 


attaggaaga 


gccacaattt 


ttggtgggag 


gtgagctaaa 




ctttttactt 


ggttgtgtca 


gtgtttttca 


tatattaatc 


cagataaaga 


tttctattgc 




ctctctccaa 


atcatttttc 


cattattttt 


ccaaagaatg 


gcctgctttc 


ttaatttttc 


60 


cctcctcatt 


ttccatcctt 


catttccagg 


gccaagagta 


aagggttttg 


atcatttact 



-208 



WO 02/062946 PCT/USO 1/48437 





gatattgttt 


gtctgtgtcc 


ccacccaaat 


ctcatcttga 


attgtaactg 


ccacaattcc 


27000 




cacgtttcgt 


gggaggaacc 


cagtgggagg 


tgattgaatt 


atgggggcgg 


gtctttcctg 


27060 


5 


cactgttctc 


ctgatagtga 


atgagtctca 


tgagatgtga 


tggttttaaa 


aatgggagtt 


27120 




tccctgcaca 


atctctcttc 


ttttgtctgc 


ctccttgtga 


gatgtgcctt 


ccactttctg 


27180 


10 


ccatgtttgt 


gaggcctctc 


cagacatgtg 


gaactgtaag 


tccaatacac 


ctttttcttt 


27240 


tgtaaattgc 


ctagtctcag 


gtatgttttt 


acagcagtgt 


gaaaatggac 


taatacattt 


27300 




acctatctaa 


attgcaatct 


gaattggaga 


aacagtggtc 


tttgtgacat 


taaataaatt 


27360 


15 


attttttgtg 


tttctaaatg 


gaaatggagg 


atgaaaaggg 


agaagtacat 


tatcatacag 


27420 




agaaaatcta 


aacaaaatat 


gtaggctaat 


gtatctgaaa 


actacagagt 


tgaagcaaac 


27480 


20 


ttcaaaaaat 


aggaactact 


ggtttagatg 


tttagatgtc 


atggagactc 


cagcccatac 


27540 




tcctgtgtac 


acctgtgcag 


tgtattagct 


ctaccctcta 


ctgtgtgttg 


gettaatget 


27600 




taggcctatt 


tcccacctct 


tagcaaaaat 


ttccacatca 


agtattggcc 


ttacatccat 


27660 


25 


ttatcatcct 


gcagtttgga 


agaatagtgt 


tgtctgaagt 


ttcaaaacaa 


gactattgat 


27720 




aatcattctg 


acgaggccat 


gtaaggttat 


atgtccacat 


tggttcaatc 


attgeaatgg 


27780 


30 


aaagtggggg 


tgaaaagcac 


tgattgactt 


agectagage 


cagagcacag 


tggaggcaag 


27840 




ggcggactca 


gtcctcagag 


ttacatggat 


ccccagtcag 


aattcacaga 


ctgtttgaaa 


27900 




gaggaaaatg 


gaagatggat 


tctgggcagg 


caacaagcgc 


atgetacagg 


tcactccttt 


27960 


35 


gtctgcccta 


tgcccaaaaa 


cactttcttc 


tgatccatac 


ttccaacaat 


gatcacagct 


28020 




acatgatagg 


cattcagctt 


ttcccaagag 


gacatcaagc 


cagaaatctc 


atgcacttaa 


28080 


40 


cactgcattg 


gaatatgaag 


ggagctctgt 


ctaaacaggg 


aactctgtct 


caatagtaac 


28140 




atgtgtttct 


tggccagaca 


attgagecac 


ctccaggtat 


gttctccagg 


aagattgect 


28200 




tggtccattg 


ttctctacaa 


ccacatccag 


catgggaact 


aaaaaattgc 


agcttctagg 


28260 


45 


agcccactta 


atatttggta 


caaatttgag 


ggcctagatg 


ccaatcatct 


atagttgttt 


28320 


■ 


aatccatagg 


gcaattcttg 


ggtcttccac 


caaggaataa 


gaaggaaatt 


aaacaaaatt 


28380 


50 


ttgtcagtct 


aatagaataa 


attacaaaaa 


tttaaaaaat 


taaagcatgg 


tgagtagaaa 


28440 




gaaaaatata 


ccatgaaaca 


tttaaaattt 


gatatattag 


taatcaaaat 


aaaattgaat 


28500 




acagaatact 


atttaaaata 


aatattaaat 


atattaataa 


ttagaagaca 


tatttaaaat 


28560 




CLL>d^clL.clL.ctcl 


del U Lada Lcta. 




CL CX. CX. L, CX.Ci.CXCX ' — . d 


r , t"3t~cfr , r , r , t~a 

L*- C*. w w 




2 8620 




tttgtgatac 


taaaaaaaag 


acaattgaaa 


tattattcaa 


aatattcaga 


agattttttt 


28680 


60 


aaaagggaat 


gctcacaaat 


ttattttaaa 


agatcttgat 


ctaaaccaaa 


ccagagcaat 


28740 




aagaaatggg 


aaaattttca 


gatatttttc 


aatctctgaa 


aaatcatatt 


taaaacacaa 


28800 
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ggaaatataa 
tatgggtgtt 
atcctgtaga 
ccctcacagt 
ccaggtatct 
tatggtttac 
actttggtgg 
atgcagcaac 
atgaaactaa 
atgattgtca 
cccaaaagag 
cgtagatatc 
ggaccccgac 
cctggacccc 
actgagtgga 
aactccccac 
gtaaagggga 
caagtgacta 
atgatactgg 
tgagaaaaat 
tcctcaaaac 
aggggcctaa 

agaaaaagga 
tatcaatgtt 
actgagtgca 
acttttctag 
tattctgcta 
aaataagatg 
tacacagttt 
acttaaactc 



tcccacagtg 
ttaataatat 
acttagtgag 
gacctgaaaa 
gccatgcaag 
agacagtccc 
actcagacaa 
acagctaggc 
taaaactcct 
agattttgac 
cagcaatttg 
ctgaaagtat 
caacaggaca 
attgactcca 
gcctatggca 
cccttaagtc 
ataaaaagag 
agataacagt 
ctctttacat 
atcagataaa 
tcttaacgtt 
ggagatggac 
cattagatca 
tgttcattaa 
gggcacagga 
tataaaagtt 
aaccatgttg 
aactctccaa 
ggtgtaaatt 
ctttactttg 



taacatatat 
ttcactgcgg 
ggagcagggc 
cagccaccac 
gccacctaag 
acccctgaag 
aggcactgcc 
ctttaacaca 
gggcataatt 
cttttttttt 
tagacctatt 
tcatctactg 
ctcaggagga 
tttgctgttt 
aaagaacaca 
tgggctgtga 
tagatttaca 
ggtaagtcat 
ctgtgatctc 
ccccaactga 
ataaaacaca 
acctgactgg 
aaatctaagg 
tggtgtatca 
gaactctctg 
tatttcaaat 
tccaacagag 
ttttacatta 
ctcgttcaat 
ggagttaagc 



cattcccaag 
acttcagagg 
accacgtggc 
caatacagca 
cccctagcta 
agagctctgt 
caacaaggac 
gacaagatac 
cagaaaggag 
ccctccccac 
tggctactgg 
gcttacaata 
cctccaaagg 
gaatcacagg 
caggccatcg 
atagtgactt 
gtgaagaagc 
gttgatagtg 
cttccacaaa 
gggagtttct 
aggaaagtat 
aatgctgcac 
aaatctgaat 
tactaatgta 
ccatcttcac 
tgtttaattt 
ctcactaaaa 
tcaaccatat 
accaggttcc 
aaacactctc 



gtgtaattat 
ttagcatgat 
cacatcactg 
gagcgcacgc 
tctgccatgc 
ccttctgcta 
tagacaccct 
aaggacctaa 
caaagcctct 
tccctcaaaa 
cagcaatccc 
aaggccacta 
gccacccacc 
tctcagcaac 
ggctttggat 
cctttcgaag 
ctgacataca 
tggactgttg 
acccaaacac 
acaaaacacc 
gaaaaactgc 
cttggatggg 
aggacagtat 
agatgttcat 
aactcttctg 
ctcttttgtt 
tggcacatgg 
acacacacaa 
caggaagaaa 
aaaattattc 



actgagaatg 
aaagaaggca 
tagaccctgt 
acctaagctc 
agagatgggt 
tactgatggt 
gttgatccac 
tgatacccaa 
cattgcaaag 
aaaaaaaacc 
aaatacccca 
ctatggagtt 
ccacccttct 
ctctcagttc 
tcataaacgt 
aacacagaat 
ctacctcaat 
atgggatgtg 
cagtctaatc 
tgaccagtac 
cacagtctag 
atcctggaac 
gaacgtcagt 
aattggagaa 
taaatctaaa 
tttcactctg 
tgaattcaca 
tttgtataca 
taaaataaaa 
tgcccccatt 



28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
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tttctgtttt gctacatttt gttcagagat 
attttttttt tattggcatt attccaactt 
5 agagaagttg aaaccaaaat caagcccctt 

tagacctcat acttaatcta gacctatgat 
ggagtatcta aagaaaggag ggtagatgga 

10 

gatgacaata aactccagat ttccagaatc 
aattcaggca ccttttcatg ctgctttcat 
15 tatgttccct ctcagagcat tagagcctta 

ctctgtaaaa tatttgaaga gatttattct 
cacagccttc aggaggtcct gagaacatgt 

20 

tatacattct agggaggcat gagacatcaa 
gtccagaaag gtgggacaat ttgaagcagg 
25 aggtaaattt aaacattttc tggttgataa 

aatagaaagg aatgtctggg ttgagataag 
gatgaagcct tcaggtagca ggcttcagag 

30 

cttttttttt tctttttgag atggagtctc 
acaatctggg ctcactgcaa cgtccacatc 
35 tcccaagtag ctgcgactac aggtgcatgc 

aaggtttcac catgttggcc aggctggtct 
cttggcctcc caaagtactg ggattacagg 

40 

aaatgtttct tatcagactt aaagtctgtg 
gcatgtctca accacacttc ccatcatggc 
45 agagccctgg ctgaggagga agtccattca 

gtttacattg tggtactttt gcaaagtttg 
tcactcttga ggcacttctg tcccattggc 

50 

tctctggtct ctgtgttcct ggcatcctca 
acaaaagtcc tttgccacag tagctatggc 
55 ttctaaagca aaccttacag gcacaatttt 

gaacacctct gagagaaagg ctgaaccaaa 
ccttggaggc tcaggacagc tagtcacggt 

60 

ctcctcagcc tttgaactgt caggcctgag 
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gtttccttca 


gattctgcat 


ggggttgggg 


30660 


ttcttagtac 


ccgtttcaca 


tttgcaaaat 


30720 


aaacaattag 


cccagggaaa 


attacgtttc 


30780 


attatttcca 


aacaccaaag 


cagtaactca 


30840 


gaacaagaaa 


ccttgatttt 


cagcctcaag 


30900 


aatgatttgt 


cttaaaatca 


gttgacgctg 


30960 


tggaggatgg 


agaaatccac 


gtgtgcatat 


31020 


gagacattat 


tggtgataaa 


aagtgtcaaa 


31080 


gagcctaata 


tgagtgacca 


tgacccacga 


31140 


gctcaaggct 


gttggggtgc 


agcttgcttt 


31200 


tcttatacat 


ttaaaatata 


tattggtttg 


31260 


aagtgagtga 


tggggcagct 


tccgggctat 


31320 


ttggtcgaat 


ttatttaaag 


acctggaatc 


31380 


aggctatgga 


gaccaaagtt 


ttatcatgca 


31440 


ataattggtt 


gtaaaatatt 


tcttttcttt 


31500 


cctctgtcac 


ccaggctgaa 


atgcagtggc 


31560 


ctgagttaaa 


gcgattctcc 


tgcctcagcc 


31620 


catcacacct 


cgctaatttt 


tagtagagac 


31680 


ccaactcctg 


acctaaagtg 


atccacatgc 


31740 


cgtgagccac 


cactcccagc 


catggctgta 


31800 


ttgatgttaa 


tgctggagag 


gtataaagag 


31860 


ctgaaccagt 


ctttcaggtt 


acattttaaa 


31920 


gatacttggg 


gccttagaat 


tttatttttg 


31980 


gatatttgat 


ggtggtccca 


ggtgtcaaag 


32040 


cttgctcaaa 


acctcctgat 


gtttttttcc 


32100 


tgttgcatca 


agtgaaatct 


gtctctagtg 


32160 


cactatttct 


ttcaatgcac 


tcacttaggc 


32220 


cctct tcrtcc 


tcactcrtaat 


tt caqccaaa 


32280 


gctgctatga 


gggcctaggg 


agccagtggc 


32340 


gccactgtgc 


tccattctag 


aactgagcat 


32400 


tgtggtgctt 


catacgaaaa 


tgctgtagta 


32460 
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cttcagagaa acagaaatgt ttcttttctt 
tgaccctaga attctgcatc cttttaaagc 

5 

ctgctaacaa ttcatctgct tcttaaatgg 
gatgtggttc attgattaca ctatattctg 
10 gttttgcttt ctggcctgga catttttgtc 

aagatgctta gtgttcccaa agcatccatt 
cagtgtccca tgacccccct aaatgacata 

15 

ggacttgatt gttgacttca aaccacccaa 
tttcttggcc tctttgccag ctgaagacct 
20 cttgggccca ggcttactgc aggagatgcc 

tcttgcactc cagcacagat cccatggcca 
gggggtgtgt gagtaagcaa gtgtggggtt 

25 

gtgggctctg tgcagggctt gcagctggac 
actctggcat tcagacaaga ggaacatggt 
30 aagccccaga gggggtgttg ccacattcta 
cagtgaacag cggcatgtta acagctctgt 
tctggggcta gctcggtcct gctgctactt 

35 

gcgaagggga aagggccaca gtgttacagc 
agttattgtc ccacgttcaa gaagaatgag 
40 tggggaagaa ttttattaag tgacaaaaca 

gtcccccacc caaagtcagg tggtctctct 
gagtcccagg cttttatggg ctcagaatgg 

45 

gcagaaaagg ctaaaacaaa gacaccactc 
ttatggaagt gtaggtacat gtaaaacagg 
50 gagaaaagca tgccaaatgg gaagagaggt 

ttgtagcttg gctttcaggc tttaaactct 
cctgcccctg tctgcctacg attgtctacc 

55 

caagtacttg ctttaattta tttttcaaag 
gaactaaggg actaggaata taaataatac 
60 cttgcaaaat aagatacagc aaatgctcag 
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tggagcattt 


atttctgcag 


gatttatata 


32520 


attcccacat 


tttggaaggc 


ttttgctaag 


32580 


atccatttcc 


atatctgttg 


atgtaaagtt 


32640 


ttggtttgtt 


ttttgtttgt 


ttgtttgttt 


32700 


ttcctaacat 


acctcttaca 


ttgattcttc 


32760 


gttcaaacaa 


ggcttgtcca 


cttcatctcc 


32820 


ctactgtgtt 


accagtgcgg 


aaccaacact 


32880 


tcaaggctac 


aacagatgta 


acacaggatt 


32940 


ccagccagtg 


acacccctgc 


ccaggctttg 


33000 


ctatctactc 


agcctggcag 


gccacacctg 


33060 


ctgcaactgt 


gcaataagcc 


tctggtggga 


33120 


gggccagcta 


tttcaatcac 


cggcacagga 


33180 


caggcatgtc 


acaagcaact 


cccgcagtgg 


33240 


ggtacccagg 


cagggatgcc 


catatccctg 


33300 


atagctcttt 


tagttctgcc 


atctgcagcc 


33360 


cagtcccatt 


gccctaatcc 


tgcccatggc 


33420 


cctgttgcat 


ggggtggctg 


ccctccactg 


33480 


tttttttgta 


ctcacattca 


gtgggtcctg 


33540 


gttatgttga 


caattgaagg 


gtgaggaggg 


33600 


actctcagca 


gagagggggt 


tacaaggatg 


33660 


cacccagagg 


tgggcagttc 


ccatgtggct 


33720 


aggagtgcag 


gctgattagt 


ttgtgagtat 


33780 


aaaagtgggc 


atgacaatgt 


aaaaggccaa 


33840 


ggaagggtgg 


tgaagggtgg 


ggaccaatca 


33900 


tctccatcca 


gcctcagatt 


tatctgagac 


33960 


ctttgggtta 


aaggtggtgt 


tcaccggact 


34020 


tcctggcgct 


gtcagatgca 


tcaatctccc 


34080 


ttcctagtct 


ttgaacatat 


gtttttagga 


34140 


ccctttcttc 


tagtcccata 


tttacagtga 


34200 


gtgaaaaacc 


aaaataccac 


tttattcctg 


34260 
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atatgctgaa ggaaaagcat tccttcagtt 
caaccagtca ctggcagagc tgggtcaccc 
5 aaaccctagc aagggaaggg aagaacagat 

caaaacaaat gaagaaacac agatgggctg 
ctctgcaact ttgtgcagca ttcctccttc 

10 

gaaacaggtg tgtttctcta taggaagttc 
atgaagttcc ctcctaactg tggctacctc 
15 gggagagctt tggggctacc tgcagcattt 

aaataatcat gtaaaatgat aatatcactt 
ttttattatt ttctgcattt ttatctatgt 

20 

ttataaatta ttctgagtca tttatcatct 
ggaatacaga aaagaaaaag aagaaaatat 
25 ttattattta atgtaccatt tttatacagc 

aatatttata gttgtttttc ctcatttgac 
cactgtagcc ccctcagaca acagtgagta 

30 

tcataaagtg caaaggaact ataccctcca 
gagatacttc atattaacca agcagtagat 
35 gagaaacctc atgaaaacat agcacatttt 

ttcctttccc ttctcacccc atgttcagac 
aagacctaag tttcaggttc ctgcaatgac 

40 

agtaagtatg gtggcagagt tgcagctggc 
gaagcaaagg aagcccccac gttcactcac 
45 ttctgccgca gcttcaaaag ccaaggtaga 

ggagcttagg gccacttaca gatcctgaaa 
ctgatatcca acccagcaga ccagacgtat 

50 

ttcaaacaca caaaaaaata tcttgagcaa 
cttttcagaa tgaagatagt taaatgtaat 
55 tatatatcca gcacttagaa caaagcctga 

cctttccttc ctcttccatc tttccctgcc 
agagctagac tacccttcca tagccagaca 

60 

ttccatagcc agacaccctc ctcacttcct 
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L_ l_ V_ I — 'wj \— V — - l_ Cl CL 


3 5220 


L LL. L LUdd 


dLddd Lyy dd 


acrr , aat"t'aa a 

ciy W Cl Cl V_ l- CLCl 


3 52 80 

*J *J &i W \J 


ty ULLLL^aa 


LCiy L Ld LdCLCL 


raarrranl" t~ 

CLCL V-< ^— - Ciy 


35340 


cll L>d ty "-y i— y 


l 0 LvUv aaa<v 


i»> u» i— y uaav,. a 


35400 


cty ttyLtyd 


La L LL-LdL-Ly 


Ldy LLy Lv~«ciy 


3 54 6 0 

J J *J \y 


4— 4— /~T<~-c rj /T ^ 4*» 4— 

^"-yyaagaLL 


LdLLLLLdLL 


t~cr~aaf~rnar*a 
LL- Lciayy aL.a 


J J J W 


Ly dddd LadL- 


ddL LyyLLctt 


r , t~cjr , ;5t"t"'hat" 


3 55 80 


gagcc ugc ug 


gLgcggy ccid 


n 4- nfnnh nt~ t~ 
y LLuyy llll 


1 r 4. n 


pt-ra nnat" C a 
v-h \_ ci^-j w c*. 


a cat acjcic t c 

^ h ^ c^. v — \^ 


aaaaacratCd 


3 5700 


ydyLaLLLLy 


LLLayadyLL 


t" at- 1 1 - art t c 

L Cl L L> LnV/ L l-L 


3 5760 


f* ^~ 4~ 4— 4— ^3 -3 /r-i 
CULL LUaaL 


4- 4- 4- 4- 4— nnfint" 


r^t"t"r , f~ , t"r^t~f~ , t~ 

LLLL-LI— i_-l_^L 


3 582 0 

J J U U 


cty LctLLLLaL 


y y LyaLayya 


t*a a craa aa t a 

O CL CL CL CL CL I— CL 


35880 


caatgtaatc 


tcatcaagac 


ttaagagctt 


35940 


tcgcttccct 


agatgataga 


gctagatgat 


36000 


ccagagctag 


atgatagagc 


tagaccaccc 


36060 


ccctccaaca 


cttgcccaca 


ggcacaccta 


36120 
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gaatcctcaa 
gtaagatgtg 
catatgcttt 
acacccttac 
tttgtgtact 
ttctgtataa 
tatataacag 
aagagatgac 
aggtaattat 
tctgttacca 
tgttattgtt 
accacaccca 
aataaaaccc 
accagattat 
aagaggattt 
aggacattgt 
ggaacctccc 
atgttgtaga 
aaaccaaaga 
tatataataa 
gggactctgg 
aggtattggc 
cttacaatgt 
ttgccaggtg 
aagggctgct 
ggccaacaat 
atcatataat 
ttcaaagcaa 
agaccaagag 
gcaagaggag 



gaccactcaa 
gagggaatta 
cttatagctg 
ataagtgtgt 
tttcacaatt 
ttcagcccta 
agcatagaca 
acagaattgt 
tcaggtagaa 
atactgttta 
gtaaaatttt 
ccacccgttc 
cattcaccca 
aaaacattaa 
cattagaaac 
tgatgttttg 
cctcatctcc 
tcctctgcag 
atgctaagac 
atcacccaaa 
gcatgactta 
agggatgagg 
tattgctaga 
ttggctcgag 
tatactctct 
ggtgagaatg 
cacataccca 
gtcacagatc 
gcaggaatca 
tagattgtgt 



gtggcttcct 
tgcactcgat 
ggggcacatg 
gctgggggaa 
gcccagcaga 
tgtggcagtg 
ggagaggcat 
gccaaggggc 
tttagatctt 
gaattgttta 
ctacagacca 
ttgttttgcc 
agtagaaatt 
agattgagta 
agaatgtcat 
tgtggtttcc 
agctcagaga 
tttagggaaa 
ctggactgtc 
cgtagaggtt 
actgagttct 
cctcaaatga 
attcagttcc 
gtctttgtag 
taccatatca 
agacaccagt 
aaatcacata 
ctgcctatat 
tctagccacc 
gggggtggtc 



ggtaccactc 
tcttcatatc 
catttacaca 
agctatgcaa 
tggcagcaaa 
taaagctaga 
tctgttcagt 
aggacagaag 
tttcgttgtt 
aaaatggatg 
tgcttctcct 
atagagatga 
aagaaaaaag 
aaagagtaag 
ttggaaactg 
tccctcctgt 
gagaggctgg 
atgagaaact 
tactctatag 
taacacagca 
ttgcttcaag 
ggtttgacta 
ttgcagttgt 
gatattggct 
agactgcaac 
aagacagagt 
catcctgtca 
ttacagggaa 
ctagagtggc 
tgcacttcca 



ttgctagacc 
ttcgatagtg 
aacacatgca 
ctctgaggct 
gtgtcctgag 
ggctaaattg 
ttcatctgtg 
gaggagtcag 
gttgctcctg 
ctaaaaagca 
cattgtctgc 
caatcgctcc 
agccaaacat 
aaattgagta 
tactggagaa 
cactatagga 
aatgggaccg 
tgagtccaac 
ccagtgtttt 
cacctgttac 
gtctcaccaa 
gagaaggatc 
aggataaatg 
agagaccact 
atgacaaatt 
ttacagtctt 
cctttgcagg 
ataactacaa 
tctcatggtg 
cctgtatgct 



aaagaaaatg 
ttaaatggtc 
cacacacaaa 
ccaaccactt 
gaactacttt 
tccctctaga 
gcccgaagac 
cccccggagc 
agagcagtat 
tatggcagtt 
acctgaagag 
taatttacac 
ggttttctca 
agactgaagg 
tttcgttaga 
gaaggaagga 
catggggatc 
acaccaggag 
tatctattgc 
ctcacaggtc 
gttccattta 
cacttccaag 
gtctcatttc 
ctcaattcct 
ggtttctcaa 
ctgtgacatg 
attctggtta 
aagggcatga 
gcacaattaa 
gtgtgaaaga 



36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
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gattctacat ggatggtctt agagcttgct ctgaaaagaa atcaggaggg agcaaggaga 37980 

ccttgaagag aatgaggctg gagattttct gatggacatc cagggacagg acataggata 38040 

atcgtcagct gccagctaag tccatgagga tactgagagg aagcccagcc aaggaatccc 38100 

agtgatggaa ccacagatgc attcaacatc caccagcctg catagcactg acatcgagtt 38160 

gggaaacggc caatcgagag agactttttc tgttcctctc tcctcattcc tgcccctgac 38220 

tctgtgagga tcaggaatcc ctcttagcaa tcctcagggt gaagagttag agaagaaata 3 8280 

gcactgaccc ccactttaca tgtcagctgc cctgcctaaa gcaggccaaa ctgaggaaaa 38340 

gaagaaattt taatatgaat gacagttgga ttttgatttt tgcataaatc tatattttaa 38400 

ttactaaagt gactgaggaa ctcttctatt agctgacagt agtaacatct tagatttcaa 38460 

■ 

acaagcagca ggggagagct agccctactg aggatcttag aaagaacagt tgtggggcca 3852 0 

ggcacggtgg ctcatgcctg taatcccagc actttgggag gccgtggcgg acgaatcgcc 38580 

tgagttcagg agttcgtgat cagcctggcc aacatggtga aaccctgtct ttactaaaaa 38640 

taaaaaaatt agctgggcat ggtagtaggc gcgtgtaatc ccagctactc agaaggctga 3 8700 

ggcaggagaa tcaccaggct ggtctcaaac tcctga 3 873 6 

<210> 79 

<211> 104644 

<212> DNA 

<213> homo sapiens 



<220> 

<221> genomic_DNA 

<222> (1) . . (104644) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> MS4A7_initial_coding_region 

<222> (17493) . . (17639) 



<220> 

<221> MS4A12_initial_coding__region 
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<222> (91488) . . (91760) 
<220> 

<221> MS4A5_initial_coding__region 

<222> (64028) . . (64180) 

<220> 

<221> MS4A7_coding_region 

<222> (19439) . . (19573) 

<220> 

<221> MS4A5_coding__region 

<222> (65149) . - (65277) 

<220> 

<2 21> MS4A5_coding_region 

<222> (66764) . . (66820) 

<220> 

<221> MS4A5_coding_region 

<222> (68118) . . (68270) 

<220> 

<2 21> MS4A5_coding_region 

<222> (82002) . . (82109) 

<220> 

<221> MS4A12__coding_region 

<222> (95210) . . (95347) 
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<220> 

<2 21> MS4A12_coding_region 
<222> (96148) . . (96204) 



<220> 

<221> MS4A12_coding_region 
<222> (97865) . . (97981) 



<220> 

<221> MS4A12__coding_region 
<222> (100890) . . (101000) 



<220> 

<221> MS4A12_coding_region 
<222> (101182) . . (101283) 



<220> 

<221> MS4A7_coding_region 
<222> (21068) . . (21124) 



<220> 

<221> MS4A7_coding_region 

<222> (23741) . . (23947) 

<220> 

<2 21> MS4A7_coding_region 

<222> (27037) . . (27138) 

<220> 

<2 21> MS4A7_coding__region 
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<222> (28139) . . (28210) 

5 <400> 79 

nnncttgagt caggatgtgg cttttcctta 

agagaaaggn tgntacggag agaagcaaag 

10 ccaaggtttg tgggaaaagt ggntacagtt 

gagagaagca cagggtctaa gcccatgtgc 

tngaaggcat atgcttgtca tattgaacaa 

15 

tgtcaattga aagcttattt tatgttcgcg 
tncagtttgc ttggcctttt aaagtgctgg 
20 tgaaactgct tttttgtgct gagtcagttc 

agtttattga tctgggtggt gccagctaat 
tgctcttagg agcagtttag gaagggtcag 

25 

aaaccataat ttctaatctt gtgtctaatg 
gcaagaaaga ggtctgcttt gggaaagggc 
30 taaactaagt ttctcccaaa gttagttcag 

ggaggttaga agcaaaatgg agtcagttaa 
ttttgcaaag gcggtttcat atcggattca 

35 

taaagaggaa ggcattttgt atcaataaaa 
tgagacggag tctcgctgtg tcgcccaagc 
40 gcaagctccg cctcccgggt tcaggccatt 

ctaaaggcac ccgccaccgt gcccggctaa 
gtttcaccca tgttagccag gatggaattt 

45 

attttaacca acgttaataa tctaaatacc 
taacaaaata ttatatatac cccataaata 
50 atctcaaatc acatgtgtat tgggtcagat 

aagatcttac acttgtgtta aagaacaaat 
aaattgaaaa gctgagttta tgtgtttgtg 

55 

ctctttccct gtggggaaat tcattaagta 
ttcaaactga atagaaagag gtttcataat 
60 agtcttcagt ctttaggctg ggaaagtgag 
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aatcacttnt 


atgggtgtgt 


ggcagggaag 


60 


gctttatagc 


tgcttcttcc 


ctttctcctt 


120 


tgataatatt 


tctgtttgtg 


aatgggtgga 


180 


ttttgcttca 


tgccttgtca 


nttaggaaaa 


240 


atattgattg 


tcaattcaag 


atcaattgct 


300 


caggtcttaa 


tagataggtt 


ttccaggtga 


360 


gattataaac 


atgagccacc 


tcacccaccc 


420 


ctaggtgggg 


gccacaagat 


cagatgagcc 


480 


ccatcggggt 


ctgcaaaata 


tctcaagcac 


540 


aatcttgtag 


cctccagctg 


catgactcct 


600 


ttagtcctac 


aaaagtaatc 


tagtccccag 


660 


tgttactgtc 


tttgtttaaa 


ctgtaaacta 


720 


cctactccca 


ggaatgaaca 


aggacagctt 


780 


gttaaatgtc 


tttcactgtc 


tcagtcataa 


840 


gtgatgaaaa 


gattttatct 


gaacaccact 


900 


taatttttca 


aaaaaaaatt 


tttttttttt 


960 


tggagtgtag 


tggcgcggtc 


tcggcttact 


1020 


ctcctgcctc 


agcctccaga 


gtagctggga 


1080 


ttttttgtgt 


gtgtttttag 


tagagacggg 


1140 


ttttaaattt 


taaccaacat 


tatttttaaa 


1200 


ctgacttgat 


cattacacat 


tctatgcatg 


1260 


tgtacaaaga 


tcatgtatca 


attaaaaaga 


1320 


gttggtaaaa 


tctgaatccc 


attttctgac 


1380 


taaagatcac 


tagagctata 


aatagaatat 


1440 


tatgtgccca 


gtaaggaaag 


agccccgtgg 


1500 


ttcaccagct 


gggggaaatt 


caggggtttt 


1560 


agttatgcaa 


attgaagact 


gaagactagc 


1620 


ttgtttaaaa 


caagtcctgt 


tgtttataat 


1680 
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cacttaccag tttcggtcaa ctgaatcagc 
tgtttcctaa aaatattttc cttttatact 
5 aaaacttgag caatacattt tgttttctaa 

tgtgtgtatg tgtaaaaatg aaaaataaaa 
aaaaagatgg caaagttgtt tcattcccat 

10 

ttagaaattt gttcaagtca taaaggtatc 
taggccaact aagaattcct aagcctagct 
15 caggtcttgt aactcagctc atacacgacc 

acaaattagg ctaacagcag gaagtaaaga 
cagtgggagg gacaatgatc gggatataaa 

20 

cctttgagtc ccctcccttg taagggagct 
attgctcttc tggtccgtgt ttgttctggc 
25 tgcagtttgc cgccgtcgca gaaccaccgc 

gtcccctgcg ctcctgatcc agcgaggcgc 
tgcacggcta agtgcccggg ttcgtcctaa 

30 

agttctcttc cgtcatccac ggcttccaat 
cttccattcc ttggaatccg tgaggccaag 
35 gccatctttg gagcagccca ccaccatctt 

aatctttgag taagttcaca attgggttat 
gataaatcca cacactccaa tagaaattat 

40 

gttaggcatg gtgtccccat ccaaatctca 
agagacctag tgggaggtga ttagattatg 
45 atcatgagtg aattctcata agatctgatg 

tcacacactg tctcctgcca ccatgtgaag 
atgattgtgt tgtaaatttc ctgaggcttc 

50 

acctctttcc tttataaatt acacagtctc 
agtaatataa tagtcaacat aatttgtctt 
55 aaaagaatga ttactttatt gagaagtggg 

agccttcttt gtatataaaa atggcagtgt 
gaattttgct ggaatttgtg ggaatgggct 

60 

cccttctact gctctcctct caccaccgtc 
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tgcagtgaga 


aaaaacattc 


agttttgatt 


1740 


caaaattagt 


tgctaaccaa 


aattaaagaa 


1800 


atgccactaa 


aattttatat 


gatataggag 


1860 


tgtttccaaa 


tgcgttttag 


taatatcaac 


1920 


tctctgctac 


catctggttt 


aagaaaatac 


1980 


attttgagag 


acaggactag 


ctggatttcc 


2040 


ggagaaggtg 


acggtaccca 


cctttaaaca 


2100 


aatcaggtag 


taaagagggc 


tcactaaaat 


2160 


aatagtcaaa 


tcatatattg 


cctgagagca 


2220 


cccaggcatt 


cgagcaggga 


gcggcaaccc 


2280 


ctgttttcat 


tctattaaat 


cttgcaactg 


2340 


tggagcagag 


ctatcgcttg 


cgtccaccac 


2400 


tgacttctac 


ccctccggat 


ctggcagggt 


2460 


ccattgcccc 


taaaggctcg 


ctattgttcc 


2520 


tcaggctgaa 


cactagtcac 


caggttccac 


2580 


agagctataa 


cactcacagc 


atggcccaag 


2640 


aaccccaggt 


cagagaacaa 


aaggcttgct 


2700 


gggagctcta 


agaacaaata 


ttttcagata 


2760 


ctgaaatatg 


caaggaatcc 


tgttcaccaa 


2820 


cttgcaaatt 


caatgtcgat 


agttgacatg 


2880 


tcttgaattg 


taatccccaa 


gtgtttaggg 


2940 


gggacagttt 


cccctatgcc 


cattcttgtg 


3000 


gttttctaag 


cagtagtttt 


tcctgcactc 


3060 


aagtcccttg 


cttccccttt 


gccttccgcc 


3120 


cccagccacg 


tggaactatg 


ggtcaattaa 


3180 


aggtatttct 


ttagagcagt 


gtgaaaacag 


3240 


tttattatct 


gatctcccca 


tttttcttat 


3300 


apt~f* , i~crr , ac , a 

d. l_- < — l_ v-J V_ d C*. 


a t~ t~ aacracrafc 


aatatttccc 


3 3 60 

^-f \y 


gacatagctc 


tcgctaatga 


gacctaagaa 


3420 


aatttcttat 


ttaggcatag 


cccttcctct 


3480 


cttcttttcc 


ttcttgttct 


tcatagtctt 


3540 



-219- 



WO 02/062946 



PCT/US01/48437 



tcaaaagaaa aggcaaatag aaagctgaca 
cccatggaga tgaaagctca ccttatggag 

5 

acactgatga tactgaaatg gccattgcaa 
agagatctga cctaaccaac ttcaccttgc 
10 ctgggcctag gccaaactaa ctttgggagg 

tatctccttt ccaaaacaag accccttcct 
aaccaattag ccacacaatt agaaattatg 

15 

agttctcacc ctccccagtc gctcctgggg 
gtgcttcgga tattttgcag accctgcaat 
20 taaactggct catctgggtc ctgtggcccc 

agctttgatt ccctgtgatt ttatcttcga 
ccccttactc accaaattat ccttaaaaaa 

25 

ttgagtaata ataaaagtct agagggtctc 
ctctctatgg caattcccct gtcttgataa 
30 aacccactga gcggttacaa tatcgtaggt 

gacttcttga aaaagcctgc atttctaact 
tcaaattgat atactaatct aggtattttt 

35 

taaacttcaa gttattctaa tattaaaagt 
ctcagtcacc tttccccacc atttagaatg 
40 accttaaaaa tctcctggtt ttattgattt 

acagaaagcc atagtgactc acccagattt 
gaccaacatt gtttcttata tgctgtgttg 

45 

ccatttattg tcaatactca gcagaactat 
aatggaatat gtatcagcat tgcaaacatg 
50 tttcatcagc ttggattctc aaggtctcat 

gtcttatggt gatgctggac atcagtgtaa 
attatcgggg aagccaaagg tttacctagg 

55 

ctcagtttgt ttttgtgaag ccttcgtgat 
ccggattcgg tttctagctt gtacaattaa 
60 ttttgtcagc agagtattat ttgtattggg 



atggattatc caacttgtta ccatacagct 3 600 

aactgagcac aaaaacctga gaatctgaaa 3 660 

aattctggct gagagcatta ttacagtgaa 3720 

ttctaacctc caagccatcc ttgttcattc 3780 

aacttaattt acagtttgaa acaaagacaa 3840 

gcctggggtc tagactgcct ttgtaggagt 3 900 

gtttaggagt catatagctg gaggctgcga 3 960 

ataacatcac tattgtaaaa cctaagatca 4020 

tgatggatca gctggtacca cccacattaa 4080 

cactcagaaa ctgactaagt gcaagagggc 414 0 

cccaagcaat cagcactccc aactcactgg 4200 

cctaatccct gaatctttgg ggagactgat 4260 

ccatagagtc agctgtgcat gaattaaact 4320 

actggctctg tctaggaact gggcagagag 43 8 0 

cttgttaatt agctctagac agtttcccta 4440 

ggtgaagctt tggcattaag gaatgcaatc 4500 

agaaataatg taattgtaca aatgagcaaa 4560 

gaggttaaaa ttgttttcac taataattac 4620 

caccatttgt ttaatgctat agctgggatt 4680 

ctagtattta atgggtgaag aaattaatct 4740 

atgattcaga cttagagttt ataagtccaa 4800 

cctacagggg tccatagtac ctgttctcta 4860 

taatacctca gggaagcaag attagtaaat 4920 

agctatttca tgaaggccag ggtgagagca 4 980 

aaatgcttta atttgtaact gcatcaacat 5040 

cacagacata gacaattacc ttcaaatcca 5100 

aaatattacc tttgccagga ctaggaacgt 5160 

tggcacaata tgagtttaaa gtatggagaa 5220 

cagctgtgta aacaatgact cactcaagaa 5280 

ttttgtttgt ggggattgtt ttgtttgcat 5340 
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atatatttgt gtattttatg tggttttgag 
tctacccagg tctctagtag tcactttagc 
5 attggcttcc tagtttttca ctacacatac 

tgccatgtgt acatacttcc agaactgcca 
gagaatccac ataatgtcac aggaatgact 

10 

aagaagctct tctatgctgt ggatagtgag 
ctataattta ataagatttt tttttctgag 
15 tgcatttttg tctaccttgg gagggaaagt 

gcagctttgt ttgatttcta gatatttcta 
ccagaaactt ctatgggaaa aaagggaaaa 

20 

cactgaggaa gtaaagcgtg atctggagaa 
aggccagttt taagaaaccc ttttatacta 
25 ttcaattttt ttaaaaaaag aaggagaact 

gcaaattatt ggaacaaatt gttttgttaa 
acaaatggga ttatcttatg gacaaatgtg 

30 

aatgatttct gaggatcagg aaagtggaat 
gcaatagtgc agatcagctg ctgttgggag 
35 gatcagtggt agtgcaaagg agggcatagg 

aacctataaa atggccagga tgtcatattg 
agagaggatt agcaaggcac ctgttgtggt 

40 

agtttaaaag acagttatct gacctctctc 
tgcctgtaaa tatgtctatg tttgaggaat 
45 aatttattgg agcttccaag agtctttaca 

gccgggcacg gtggctcatg cctgtaatcc 
acgaggtcaa gagatcgaga ccattctggc 

50 

atacaaaaaa ttagctgagc gtagtggcat 
gcaggagaat cacttgaact cgggaggcag 
55 taccactgca ctccagcctg gcgacagagt 

aaaaaaaaag aacattgcag gggccaaggg 
gctgaaaagt caacttgtaa aagtcagatt 

60 

acatagacat gagggccttc agaatgaaga 
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gggcatgggg 


gttctgctca 


tctctgtgtg 


5400 


ccagatagat 


tctttagcca 


ttaactaaat 


5460 


aataacgtac 


ggtaatcaga 


tgtgtgactt 


5520 


gttcaaaata 


aacagaccct 


ccttgacatg 


5580 


cacatacctt 


ccaggaagat 


atgtgagtaa 


5640 


cagagctata 


aagaattaag 


gctatagaac 


5700 


ttttccctgg 


tggaataaac 


aaagggaaaa 


5760 


aacatgattg 


tgaagtctct 


tagatttaat 


5820 


agagtttgtt 


aacatttgta 


ataataggca 


5880 


cgtgaacaaa 


aaaaggaact 


atggcatttt 


5940 


ataaaaaaca 


gacaaacaca 


tattggtccc 


6000 


agcccttttt 


aacagaaagt 


gttaatagta 


6060 


cataaaattc 


acttagctct 


cctaattacg 


6120 


attcaatcag 


tcttaggtct 


tagagacgct 


6180 


acagaaatgg 


gatgggaact 


tgaagaatat 


6240 


taagtccttc 


tgatttcgag 


aaattactgg 


6300 


atagaaatca 


ttgcaatttt 


aataggctaa 


6360 


atgaagcaaa 


ctttatagat 


ataaaatgca 


6420 


gagataaaga 


cacctttggg 


agactgtact 


6480 


tacacacaat 


cccagtgtct 


tctaaacccc 


6540 


caagtttatt 


acagactaaa 


gaatcacagt 


6600 


tgaatgacta 


tcactaagag 


atgggattcg 


6660 


agaatggcaa 


gaatgttgca 


ggggcccagg 


6720 


agcactttgg 


gaggctgagg 


cggacgtatc 


6780 


caacatggtg 


aaaccccgtc 


tctaccaaaa 


6840 


gcacctgtag 


tcccagttca 


ggaggctgga 


6900 


aggttgcagt 


gagtcgagga 


gccaagatca 


6960 


qaqac tcccrt 


ctcaaaaaaa 


aaaaaaaaaa 


7020 


tatagcttcc 


cttttgccct 


ctgaagtttc 


7080 


aattgagaaa 


aggcatacaa 


attattcaac 


7140 


cccaaaaaat 


ataggggaaa 


gtatccattt 


7200 
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gtatgcttag gttcaacgaa gtaaggacag ccctgtagaa atatgactgg atataatcta 7260 

tgctaatagg cttagttaga accgagcaag acccgtctgt ctaggttttt cttggcctct 732 0 

ctgtgctgca ttccttcatt ttggttgtag gacaggacca tctctggaat gggcatctta 7380 

tgacctacag tcaaacaaga tagctcagat gatttcttta tgaccagttt tacacagata 7440 

gagtggaagg agagttagag taacactttt aggttttaag gctggctgtg gcaagagggg 7500 

tgtctggttt ctatgaccca ccttggggag ggattcttgt ttgtacagct agcctcaaag 7560 

gaagatagga ctgagagaca ggaaaggaag tcagagaaaa acttttgctt ctgaggtatt 7620 

gttttggggt attgttttct gagcccctaa atacataata ggtatctttg tctttgccct 7680 

ttttattgtt cttttcctcc ttttttctct tatgtgccac tgtcaccatt ctcaggtggg 7740 

ctttcgagaa tgaatgtgca aaattcatga ttagggtcaa ccacatctcc atagctatgg 7800 

gccttacaag cattaactgt ctatcatttg catcttcaga ctgatattca gaacagtttc 7860 

aggattctca cactagccta gagaacaatt ttttaaataa ttccatcaac atattgatat 7920 

gggagagggg cagggaagtg ctaagtagag aagggcaggg tccctggcga ggttccaccc 7980 

tcggcctttg cccacaaacc caggtgagga caggcactcc tgttttcatg ccaaatgttg 8040 

catttccaag ccagccatgc ccctcctcct gtgcctataa aaaccctgat accctagtgg 8100 

gtatacacac acaagtggct ggatgtcaag aggaacacac tggcagaaga acacaccaac 8160 

agttgctggc agaccatcga tggcagaaca aagcagatac caaaggaaat tcagctgagg 8220 

gtggttggag gagagccagg ctgctgagca gccctactcc agagaagacc accttcccac 8280 

tccatccccg ttctggctcc ctgtccatct gttgataact actatcacca ctcaataaaa 8340 

ccttgcactc actctccaag cccacatatg atctgatttt tctgggacac ttgggcaaga 8400 

accctgggat acagaaagcc ctctgtcctt gcgataaggt agagggtcta actgagctca 8460 

ttaaagcaag gtgcctgcgg acggctaagc tgaaagagca cattgtaaca tacacccatt 8520 

tgggcttcgg gagctgtaaa cactcaaccc tagatgctgc catggagtcg gagcccacgc 8580 

tccccatgac ctgcccgtct gcatgcttcc cttaggggtt tgagctgcgg ggcaccaaag 8640 

aagtgagcca cacccccatc acgcaccctg caagggggat aagggaaaac tcccctcgtt 8700 

tcaatatgtt aaagcttatt caccccatga gagccatttg gttggtaggc tattaattat 8760 

tgcctcaatt tcagagcctg ttattggtct attcagggat tcaacttctt cctggtttag 8820 

tcttgggagg gtgcatgtgt ccaggaattt atccatttct tctagatttt ctagtttatt 8880 

tgtgtagagg tgtttatagt attttctgat ggtagtttgt atttctgtgg gattggtggt 8940 

gatattccct ttatcatttt ttattgcttc tatttgattc ttctctcttt tcttctttat 9000 
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tagtcttgct 


agcagtctat 


caattttgtt 




attaattttt 


tgaagagttt 


tttgtgtctc 


5 


tatttcttgc 


cttctgctag 


cttttgaatg 




ttgtgatgtt 


agggtgtcaa 


ttttagatct 


10 


tataaatttc 


cctctacaca 


ctgctttaaa 




tttgttctcg 


ttggtttcaa 


agaacatctt 




gtagtcattc 


aggagcaggt 


tgttcagttt 


15 


cttaatcctg 


agtaaaatgc 


tattcttaac 




tgttactagg 


cttattacta 


ggactatgta 


on 


tctaggagaa 


tgaaaacacc 


atagaggaag 




atccaagcac 


ctagaacaca 


gcctggcaca 




gaaaagtgag 


gaatattttt 


catgtggaat 


25 


ttcagtaagg 


agattttctg 


tgaagtacta 




agcagaggca 


gagaccaaga 


gaggcagaaa 


30 


agagactccc 


ccatgcttag 


tcatcccatc 




tacaaagcta 


gagatgctca 


gcctcatccc 




ttcatcatca 


ctgaatggtg 


agtcccagca 


35 


catttgtttg 


tatttcactt 


ggaaggtagg 




caagtccaga 


tgaccagtaa 


agcaagttcc 


40 


agtcccagcc 


aggcacagtg 


gctcacgcct 




agcagatcac 


gaggtcagga 


gtcgagacca 




actaaaaata 


taaaaaatta 


aacaggcgtt 


45 


ggaggctgag 


gcaggaggat 


ggcgtgaccc 




cccgccactg 


caccccagcc 


tgagtaacag 


50 


aaaacgatca 


cagagcacag 


tccctgaatt 




ttactttcta 


gggtgtgtga 


cagggtaagt 




gccaaaagac 


agaaaccttc 


tgttatttta 


55 


CCtQtqtat t 


aaaqaactqc 


aaacracataa 




actgcatgaa 


tcagccacag 


cccagtcact 


60 


gtttccccac 


ctgtaaaatg 


ctgaagttgg 


ggcgtgatac 


gagattctaa 


gaaacaggga 
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gatcttttca 


gaaaaccagc 


tcctggattc 


9060 


tatctccttc 


agttctgctc 


tgatcttagt 


9120 


tgtttgctct 


tgcttctcta 


gttcttttaa 


9180 


ttcctgcttt 


ctcttgtggg 


tatttagtgc 


9240 


tgtgtcccag 


agattctggt 


atgttgtgtc 


9300 


tatttctgcc 


ttcatttctt 


tatgtacgca 


9360 


ccatgtagtt 


gagtggtttt 


gagtcagttt 


9420 


atcagttctg 


ttccattggt 


ctattcacct 


9480 


gtatgcttcc 


tcactgttct 


atttgaacat 


9540 


gtggtttttt 


ctgttctgtc 


cattgatgca 


9600 


gaagaagcac 


tcaaactgtg 


gtatttcaat 


9660 


tttttaaagg 


aatgttgtgc 


cctacacctt 


9720 


cgcgaggcct 


gctgtgtgca 


tggcgagatc 


9780 


cctggcagag 


ctgctgagga 


tggtgtagaa 


9840 


atgcccattg 


tcatgtgtag 


cctccagtta 


9900 


catttgagat 


aattgcattc 


ttttgcccta 


9960 


attctgatat 


caagagcacc 


aggaaataac 


10020 


gtctcacacc 


aggtttgggg 


atttactgtt 


10080 


tggaagtaat 


aatatagcag 


ttaagagcac 


10140 


gtaatcccag 


cactttggga 


ggctgaggcg 


10200 


tcctggctaa 


cacagtgaaa 


ccccgtctct 


10260 


gtggcgggcg 


cctgtagtcc 


cggctacttg 


10320 


cgggaggcgg 


agcttgcagt 


gagcagagat 


10380 


agcaagactc 


cgtcaaaaaa 


aaaaaaaaaa 


10440 


aagattacct 


ggattcaaat 


ctagacttca 


10500 


tacttaacct 


ctctgtaatt 


taggttatca 


10560 


gcagagaata 


cttgctagaa 


agagtcatta 


10620 


agaaaacact 


gaaataatac 


agacatagaa 


10680 


actaactccc 


taagcaatcc 


ctggccttta 


10740 


ggggaaaaat 


cgtttcactt 


tcattccggt 


10800 


gaaaattccg 


tgagtggagc 


caataaccaa 


10860 
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gtcaaacatc 


aggggggtct 


atgaacactg 


ctccaaggaa 


tctctctctg 


gataaccaga 


10920 


5 


ccttgtgccc 


tagccccagc 


ccgtcatctc 


cactgetaga 


gaaaaaaaat 


acagttagtc 


10980 


tgtccaaggc 


tgttagagtc 


ctccaacata 


taagaegcat 


ttccatttta 


ctagcttcat 


11040 




ggtctgcttg 


gggtggggat 


ggeggggaga 


tegtatcett 


gaagtaggca 


ctaggactgt 


11100 


10 


gagtcacttg 


cctgtctaga 


gcaccaggac 


tgtttgcaaa 


gggctgagga 


gatttccatc 


11160 




aatggaaaag 


agcatgaatt 


tagagtaaac 


atagattaca 


cttagagatc 


agagatcaat 


11220 


15 


ctagagataa 


aatacacttt 


tctctaaagg 


gggtgggaga 


agatgttgga 


caagaaatac 


11280 


cagattttct 


taatggttca 


tttatgtcag 


gctctatggt 


ggggactttt 


atgttatttg 


11340 








^— ' u- y Cl Cl Cl Cl Cl ^» 


ttttttcaaa 

lw ^ C- C- \^ ^ Cl U Ul 


ttttatoaaa 


aaataaacto 


11400 


20 


gatatttaac 


tgattttctc 


acgaattcct 


tctgaggctc 


agaagtgtga 


aagaagecca 


11460 




aatgaccaag 


gttctttaac 


aatttggtca 


aaacaggtca 


tagacccaga 


ggaaggtctg 


11520 


25 


gatcctggaa 


gttttcttaa 


gcaaagacag 


tgcttgttgt 


ggatatgaag 


atgttgacta 


11580 




agctgggcac 


agtggctcac 


acctgtaatc 


ccagctttgg 


gaggecaagg 


caggtggatc 


11640 




3 f ft na cr^ 1~ r~" 

Ct V_ y Cl V— j CX L> V— - 


^ rrrj a o t" 1" C Tl A 

Ci y y Cl l_» C V— < lid 
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accaacataa 


ataaatcctcr 


tctctaotna 


11700 


30 


aaatacaaaa 


aatagacagg 


catggtggtg 


ggcacccata 


gtcccagcta 


ctggggaggc 


11760 




tgaggcagga 


gaatcacttg 


aacctgggag 


gtggaggtta 


cagtgagctg 


agactgcacc 


11820 


35 


attgcactcc 


agectgagea 


acaagagcaa 


aaaactccat 


ctcaaaaaaa 


aaaaaagatg 


11880 




ttgaccaaat 


ggcagactct 


tgccatgaag 


caagcccaga 


cctcagtgcc 


aagagatgtg 


11940 




CL CL C I*- Cl y Cl l_ V_ 


r i t"apc't"r*t"1~c 

N — - y C N— - ^ — ' — V — - 


r*a raaactaa 

k> OL \y> (A 


acatcrtcract 




tccttcaaca 


12 000 


40 


atatcacctc 


atttggagaa 


gaactggact 


aaaataccct 


ccaagtagtt 


tatgagttat 


12060 




ttaaatctcc 


caggcttagg 


tttgctcctc 


tcttagggga 


tagccttgcc 


acccttcagg 


12120 


45 


aaatggagac 


tgtatatgga 


gcatgttgtg 


ccgccagctg 


ctgcatgtcc 


cagagccatc 


12180 




ttaaacccta 


cccttctgaa 


caatggctag 


tctacaagaa 


gecttaaaag 


ggatgtgaaa 


12240 




at" acre 3 araa 


C V— • »w ^* *w Cl 


acaacaaata 


cttccccaaa 


tataaactta 


tatacfaocct 


12300 


50 


gatgactctg 


cctctgaggc 


caatccactg 


gcatctttcc 


caagatgtgc 


ctggacttcc 


12360 




ctgagcatga 


ggecaaagat 


ggagaatgga 


aacacccaaa 


agaagtaaca 


aggagacacg 


12420 


55 


gaattttcca 


caggttgett 


tcagcttacc 


aagagcctgg 


aaagtgagag 


aatagattct 


12480 




ccaaaattca 


aaactaacat 


tactattatc 


aatatttatt 


gtctactttt 


tgtaaactct 


12540 




ttgcatcttt 


tttctttaat 


tctcccaaca 


cccacagagt 


ggaattatta 


ttatccatcc 


12600 


60 


accatttatt 


taatagacaa 


gaaaactcag 


gctaagcaaa 


actagatgac 


ttaaataatt 


12660 
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ctatgtagga gtagggaaaa tgaattttaa 
aaggtctttt tctcatgtgt cacttctatc 
5 gaaagaggaa gcagaggcac tttttatata 

atcctgtgta atcataaata ctgctctaag 
gcagagcacc aagattgttc tggcaaggaa 

10 

cccagaggtg agctgatccc cagccacagc 
caccttggtt tagatcatga cttttgggga 
15 gaattggatt atcatcctgt ctctgattct 

agtccccttt ttgggtctca gtttactcac 
tcttaggttt ccctcattga ggtgattttc 

20 

ctcagtctta cagagtctac tctcaacaaa 
gaaccccatg gtcctactct aattctgtaa 
25 ctccatgtca tgatttatct tcaaagcacg 

ccctccagtc tatctcctca aggcgaatgt 
tgtttggaga tgatagcatc agtctggaga 

30 

ataggattcc ttctaaaagt agcaacgtgt 
taataattcc acttccatac atcactgtga 
35 tgcaaagatg ttcattattg cattgttcat 

aaatagtcat gaacatgtga ttgattaaat 
accatttggc tttttggttg tagtaaaata 

40 

gtgttttttt gttgtttttt gtttgtttgt 
gctcagtcac ccaggctaga gtgcgtgatc 
45 tcaagcgatt ctcccatctc agcctcctga 

gcctggctaa tttttgtatt tttttagtag 
gtcttgaatt cctgacctca ggtgatctgc 

50 

caggcatgag ccgctgcacc cagccttaac 
agcacattca caccattgtg caaccatcac 
55 cccaaactgg aactccgtgc ccattaaaca 

cctggcaacc accattctac tttcagtgaa 
gaatcataca atatttgtcc ttttgtgact 

60 

ggttcatcca tgttgtgcca tgtatcagaa 
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ctgggaccgt 


acaaatcact 


agtctgcaca 


12720 


ttgtgtcatt 


ttgccagacc 


tcagaattta 


12780 


tcaactgtct 


gtaactacag 


caaaattcac 


12840 


aaagggacag 


gaagtctcag 


aggctggaga 


12900 


cagccagtgg 


gaggttccag 


ctgagcgctc 


12960 


acacaggacc 


aggctgcgag 


aacaggtaga 


13020 


agattgtgtg 


cttttgaaaa 


gacagccagg 


13080 


gaccacctgt 


gtgagttgga 


gcgtcagctc 


13140 


ttgtcaaata 


agaaaattgg 


gtaaaaccat 


13200 


ctcaatgttc 


tgcattctcc 


tggtgtgtca 


13260 


tactgtgcaa 


ttagactgaa 


tcttgcatta 


13320 


tacagaaaat 


tgctgggcag 


agctgttgac 


13380 


gattttagag 


ctgtcttgaa 


agctgtgggt 


13440 


ggaaggacaa 


agcatcctct 


cagactgtgg 


13500 


cttttgtgaa 


acataaagaa 


tcttggttat 


13560 


atctaaattc 


aagtgtcata 


gatttaaatg 


13620 


gataattaaa 


tgcaggaggt 


atcacaaaag 


13680 


agtggcaaaa 


agaaaaaata 


gaaaccacct 


13740 


gaatctagag 


acctccattt 


atttaaatat 


13800 


tgcttaaaat 


aattgtagcc 


ttttaaccat 


13860 


ttgtttgttt 


tttaattgag 


aaagagtttt 


13920 


tcggcttact 


gcaaccactg 


tcttccaggt 


13980 


gtagctggga 


ttacaggcac 


tcaccatcat 


14040 


agacggggtt 


tcaccatgtt 


ggccacgctg 


14100 


ccgcctcggc 


ctcccaaagt 


gctaggatta 


14160 


catttttagg 


tgtacagttc 


agtggcatta 


14220 


caccacccaa 


ctccagaact 


ttcttgtctt 


14280 


ataactttcc 


att cccc tct 


CCCQCCaQCC 


14340 


tttgactctt 


ctaggtgcct 


cacataagta 


14400 


ggattatttc 


acttgccata 


ttgtcttcaa 


14460 


tctatcattt 


gcttttaaat 


ttaggctttt 


14520 
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tattatttct attcatcggt caaagatgtc 
gttataaaat ggcatggata gaatgactat 

5 

atataaagag gagagagata aagtgacaga 
taggaaacaa gagtgatcgg gtggcagttg 
10 caaactgtat tcccttgtaa tggtggtact 

taagaaataa caaaggaaaa taaatgttca 
tcaattaaac tttcgagtcc tagcttcagg 

15 

ggtatgagat tcgaaaaata aagaaaggaa 
tagtaggaag gagtctagtt ggtatatttg 
20 gccaagtttc tacaacatga aagcaagttg 

aagatcagca atgtccctgg gagaggatat 
caatagggga ggaagaatag agaggatgag 

25 

acctacaggg atatagaaga gctctggtgt 
ctacctgcta gataagcaat cttgcacaag 
30 tatcttcaaa gttcagcacc catctctgag 

tcttaataga atgccacatt catagtacgt 
ttatcatgaa ctactgtttg attctggaac 

35 

tccttctctg gctttaaaga tgtgctggaa 
gcttcaacat tattcccctc ataggacatg 
40 aaaattccaa gaaatgaaaa ctatctcctt 

ttcagataaa gtatctcatg cccggagagc 
gagtagaaat aaaatggagg tccaaacatg 

45 

tcagagagtt acaacaaaag acactcacgt 
aggatagtgt cattatcctt attttacaga 
50 atcctgccta cggccagctc atcaatatca 

actgatatgt tttgggagtt tatcacatgc 
catcttatcc aacctgtacc acaaacctat 

55 

gtttagcagt aactggtgga tatgggatct 
tcagatgtgc aactagatgc ctaggcattg 
60 cctcactagg aagagattct aggtgtcatt 



tacagaataa 


tgttgagtac 


aaaaagaaca 


1458 0 


ttccgtagaa 


ttgtgaatgt 


ctgtgaacat 


mm -\ _+m* m mm. 

14640 


tgggaagaga 


gaatgtaaag 


aatatgtgca 


14700 


aaagttaagg 


agctaatttg 


ttttcttctt 


14760 


ttatgcactc 


tttcaaacag 


aacaaacagt 


14820 


tggtatcaga 


taggctatag 


cctgaaagta 


14880 


ctgagaatca 


agcattttct 


gcagtcctta 


14940 


cccagaactc 


aggaggctgg 


tgacgaagac 


15000 


ggctattgag 


ccacacagag 


caaggtaaaa 


15060 


ggaaaaagaa 


atcttcaacc 


tatattccct 


15120 


ggaactgatg 


agtggcagcc 


aatgcttcaa 


15180 


aatcagttgt 


agctagccgg 


gcaagaacaa 


mm w— mm- jk my. 

1524 0 


tagatggcct 


gtgcccaaat 


cttgcctcca 


mmw mm. j-m gmm 

15300 


ctatttcacc 


tctccaagcc 


tcatttttct 


15360 


actgcagatt 


aaatgagact 


gtcacaaagc 


15420 


gcccaatcaa 


tgttagcaat 


gattatcatt 


15480 


ctagaatttc 


tctaagcctg 


acactttttc 


15540 


ggctcaccct 


gactccactg 


cagggattca 


mm mm. *m- mm. mm 

15600 


taacctcacc 


cacacttcta 


cccagatttc 


15660 


aaacaaagag 


aactttaggt 


ttaataattc 


15720 


atgagaaact 


agactaaggc 


acagatctag 


mm m-mm mmmm mm. 

15780 


gatcgactat 


tatggaacac 


ctgataaaac 


m* m-m m*. - _mm 

15840 


gtgtcattca 


aatcctcaca 


agagtccttc 


-m mm. mm, mm f\ 

15900 


taaggaaact 


ggggatgaat 


gagggaaagt 


15960 


gaactgggct 


tccaatgaat 


attagataaa 


1602 0 


ctccccctat 


gctgagtcca 


ttatatgtgt 


16080 


gaagtggtga 


aaataccttg 


cccagggtga 


16140 


gaacacacga 


ctggactcca 


agacatatgg 


16200 


gactcccatc 


ccagtgcact 


tttccatctc 


16260 


tccaattttt 


acatgttctt 


cactccaccc 


16320 
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aagagaggcc 


agaaccattc 


ctaaagcacc 


aacacattga 


ggcaaaatgt 


caggcagcag 




agctgtggca 


cctttcaagg 


atgacaagcc 


tttaactaaa 


ctctggcctt 


cagtagggag 


5 


aatgttgagc 


tgaaaacaaa 


tctgggcaac 


atttcaggtg 


ctgagccttg 


ttgctgacac 




cacagaggcc 


cagcaatgct 


gctgagttag 


tggtggcaga 


caacttgctc 


ctggaatcta 


10 


caagggaact 


ggaaatgtta 


ggaaggaaga 


aaggaaggaa 


gaggaagagg 


aggagaatta 


acatggcata 


gagaaaaaaa 


tatcacaagc 


gtctgagcca 


gacaagtctc 


aatccagtca 




tgtaaacaca 


ggctgttaat 


atttgctacc 


atatattgaa 


cacattctaa 


gtgccaggct 


15 


aagtacttta 


tgtgactatt 


tcattcaata 


aaagtcactt 


gagcttatta 


gccccaggtt 




tctaatgtat 


aaaattgatt 


taataacatg 


taagttacag 


ggttgttata 


aggattagag 


20 


ctgcaatggt 


ctaatgcttg 


gaattcagca 


ggagttaagt 


aaatgtttac 


attactaatt 




acgaagccaa 


tataagcaag 


gaggaagatt 


ccggatggga 


caggccatct 


cgctatagga 




aaggaaagtg 


gaacagcatt 


catcctcaac 


atttttacga 


agacaaaatg 


aagactggag 


25 


tagaagactg 


atcagtgcag 


gtgtagcata 


aaagtgtaat 


cctggaagat 


gtggtgtgag 




aaggtagcac 


aagtgaagca 


gagatacagg 


agatagggaa 


gggaagctgg 


aagcagaggt 




cactggaggg 


agagggagat 


ggacacattc 


agggctacaa 


agcaagttct 


atgtgatttg 




ctcacctctc 


aattgtggga 


cccctcaaaa 


tgtgtacagt 


actctcccag 


tgacatgctt 




cttgaccaca 


atggatgaac 


tgtgcccagc 


atgcccactt 


tccaatgctc 


cactgatccc 


35 


catgttttgt 


ttctgaagga 


caaccagcct 


tggaataatg 


gcaaatacct 


tcttaagttc 




ctacaaagta 


tggtccctgg 


gaagtttatg 


tctgtaagtc 


aaaccttggg 


aagtaactga 


a n 
4U 


gttttgatgc 


ctcttccagc 


atcatcagca 


tcatgctatt 


acaatcccaa 


accatggggg 




tttctcacag 


ctttacacca 


aagggcatca 


ctatccctca 


aagagagaaa 


cctggacaca 




tgtaccaaaa 


cgaagattac 


ctgcagaacg 


ggctgccaac 


agaaaccacc 


gttcttgggg 


45 


taagtccacc 


tcattataag 


gggaatactg 


agaaaatact 


ttgtaaatgt 


atctagacta 




cagatctaca 


gtgggtctgg 


gaaactcttt 


cactcttttc 


tggctgactc 


caggctaatg 


50 


aggacatgaa 


atggagcttt 


aagggaagac 


tcttaaagct 


ctgcagcagt 


ttctagaaac 




taaaaccact 


agatcttttt 


cactagcatg 


cttcacaatt 


aacaggttag 


tctttcttat 




tttatttatt 


tatatattta 


tttattttga 


gacagaatct 


ccctctgtca 


cccagcatgg 


55 


aqtqcaqtqq 


cacqatctca 


gctcactqca 


acctccgcct 


CCtqqqttca 


agtgattctt 




ctgcctcagc 


ctcccgagta 


gctgggacta 


tgggtgcacg 


ccaccacgcc 


cggctaattt 


60 


ttgtattctt 


agtagagatg 


gggtttcacc 


atactggcca 


ggctggtctt 


gaactcctga 




ccttgtgatc 


tgcccgcctt 


ggcctcccaa 


agtgctggga 


ttacaggcat 


gagccactgt 
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gcccagccac aaattagtct ttcttatttt 
accatatgga ttttactcaa tatgattaaa 

5 

gagatgagac gatcaagact aatctcatga 
taagacatcc aagtgtccca gatcatcatc 
10 gttttgaata ccctcttata atacccaggt 

ctgccctgtt tctttacaat ctattcttag 
aaagttagat catgtattct cctggctcaa 

15 

aaatgaaaag caaagttggt aaaacatagc 
ctccagtact tcacaagcct catgtcctac 
20 atggaacttg tctgtacctt tacagacaag 

agtgattttc atttgcttca ttccctcgtg 
caataactgt tgatgatgtg aatggttgac 

25 

atctttcaat tgcacatgca ttaaactatg 
cagctaaact tcagttcttc cacttgcgaa 
30 aaataagatg aattacagca gaagagatga 

tcttcattaa cactcctgtg aaaagtttgc 
ttatctactt ttcaagaaaa taaatttaaa 

35 

aaaatgtgtg tatgatcact ttccgatttg 
ggttagctat acatgacttt ctctagaagc 
40 tctttaatat aacaaattat aatgtatttt 

aataatcact tttatctgtt tgaagtgcag 
tgtccagatc ctgtgttgcc tgttgatttc 

45 

ctacccctcc cacttcaatc cagcaatttc 
aggagctctg tgtgtgagta gaatggggac 
50 accaggtgca tagtcgtgga gtgactcaga 

aggcctcgac tttccttcta tcattgcccc 
gcagagagac ttaatctgtc ttcctgcaga 

55 

gggtatacca gcacagaatt tcttacctgg 
attttccata tccctctttg aggctttgat 
60 taatcaactt ttcaggtaga gagtgagaag 



gtcattagta cagtaaaaat tttctattta 18240 

ttaactttga gcttattttt caaatatttt 18300 

cccatttttc cccgttcata tgagtaacta 18360 

cctcatatcc agaactttaa tctccatgtg 18420 

cctattcact gcctatttag atggatctcc 18480 

tacagcaggt agggtactct tttaaaaatg 18540 

aattatccag tgggattcct tttttgctca 18600 

ctaaaaatcc ctacatagtt tgtaccatgc 18660 

tactttgctc ttaactctat tctgccttgc 18720 

acatagtaca catataagct ctccaatggt 18780 

cctggcacac aacatacctg ttgtggtatt 18840 

ttcttttttt atttctccta tgtttgcatc 18900 

agcaaaagca tgggctttag agcaagctaa 18960 

ataattatta ttaaaatcca atgatgaaca 19020 

cctgctctat ctaatttcaa ctgaaaaatc 19080 

tctctatttc cacctcactg gacaactttt 19140 

aactgtcaag tatctttttc aaaaaaagtt 19200 

ctggtttaat aatcctatca gaaaagaaga 19260 

atatcgattg gctccttgta acatttttcc 19320 

atactcttta tactttaccg tgtgcattca 19380 

tctagatttc ctgtcttctc tcttgtagac 19440 

aagtctgggg gccatcttgg tttttgctcc 19500 

caccactttg atgtctgggt acccattttt 19560 

tctaagaggg gacatgcttt ataaatgcta 19620 

ctccaagagg ccaggaggca aaaggcggca 19680 

acgtcacatt tcccctattc aagaagagca 19740 

gcagggggca agagggaatg gtaatacaat 19800 

cattacagca tacctctaga aaagtggatg 19860 

gaggccattg gattccctcc attgtaagaa 19920 

gccattttta aacaataaaa cgagcaccac 19980 
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tggttgatgc 


cacaggtttg 


cagagatttg 


gattaacaag 


ggtaagattc 


atgggaggca 


20040 


gggacatgct 


tgctgtgtgt 


ccagaggtaa 


ccctgagcac 


tagagaaact 


ctctttgctg 


20100 


ccttccacct 


aatggctcat 


gaaaaagggc 


ccagtagatc 


catttttcag 


agagtggcca 


20160 


gggcagactc 


acacttccca 


caaagggtta 


tttcttctcc 


acgtagtatt 


tggtacaaag 


20220 


gataaaaatc 


aaatattctg 


ggcaaatgaa 


ctatactaag 


gtaccacttc 


tgtgtggata 


20280 


cagtccctgt 


gtgaacacaa 


actgtgcagt 


cacataagga 


ggccctggtc 


ctgccccaga 


20340 


gcacaatttg 


tccttgactt 


ctcctgtaac 


cagtggttgt 


gggccccaca 


tctccttaga 


20400 


gaagctagcg 


ttagctcata 


ctggaccctc 


tttgcctgtg 


attattcaat 


atccaagcca 


20460 


cacacttaag 


caacttttca 


gcttctcaga 


atacccttcc 


tctgccagct 


ggatctttaa 


20520 


taccactgag 


attaaatact 


gatttctggg 


gctttatgat 


aaagacttca 


cgtctgagcc 


20580 


tgctcttccc 


actttcctcg 


actcccattc 


tgactcaagc 


aacatacacc 


aataacttcc 


20640 


agtccatgtg 


tttctgcacc 


ttgtcccatc 


agagcctctt 


tcctgagtgc 


ttctgttggc 


20700 


tgttccaatc 


tctctgtcca 


ttcaagtcat 


tggtccatct 


caacctcctc 


tctttcaacc 


20760 


ctgtaattta 


gtaacactct 


tctccttccc 


aggtacacct 


tttctccagg 


cctcctcagt 


20820 


tcctagtgat 


ttccactaaa 


acaaatccaa 


cccctctgtg 


catctgcatg 


tcattcaaag 


20880 


tcaatataga 


cattcttata 


tcaggtggga 


attaggccta 


tctagcaata 


tagatccaca 


20940 


agatttctgc 


agtgttctct 


ccctaaaagt 


tttccaacat 


cactttttga 


ttgagtgatt 


21000 


gacagtgaca 


tggtgggcac 


atttccacaa 


gacaactgaa 


ctgatcattt 


ctctctcttc 


21060 


tgggcagttt 


ggcattactg 


gatccctctc 


aattatctct 


ggaaaacaat 


caactaagcc 


21120 


ctttgtaagt 


ataaggacca 


tcaaatctca 


gctggttgga 


attgaaggaa 


cgtcagagtt 


21180 


aattcttcta 


gtttaaaaat 


ccctatttta 


caatttagag 


agaaaaagct 


tagaacaggc 


21240 


taggaagccc 


aggttttcct 


ggctgtccca 


ttggtagaac 


caagagtaga 


gcccaaattt 


21300 


cttgccttct 


agaccagtgc 


tctttcgtct 


acacaatgtt 


gctcctggat 


tatattccaa 


21360 


gcttggagga 


ggtttacctc 


agttatttaa 


aaagtgtaca 


tttttaaatg 


taccatggac 


21420 


tctttatagc 


ttgtaacctt 


gaccttgaat 


aaggtagcca 


cccatcccag 


agagagccta 


21480 


cagatccttt 


tttctataaa 


acctgcctct 


tggggttctt 


tttgatgcca 


aatgtcccca 


21540 


ttcatagagt 


tgggacttca 


cacaaattct 


agcccttcag 


cgtcagggtg 


accaattgtt 


21600 


tQQt tt tCCC 


aqcractcrtct 


cagttttagc 


ccaqaaagcc 


tatatcctgg 


gaaacccctt 


21660 


actccccagg 


caaagcagaa 


taactggttt 


accctactct 


gtgaccgtgg 


taacaagttg 

* 


21720 


aaagaagtcc 


atgatatatt 


ttcaaagttc 


tactatttaa 


gcaactgaaa 


taaacttctt 


21780 


tcagtcaaat 


atcaggtggg 


gaggcattca 


ttcccctgtc 


agaagagggt 


gttgccgtct 


21840 
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tcatccaggc 


tggaggagac 


tatttccatc 


5 


gagaaaaaaa 


atcagtttaa 


agtaaggtgg 


gggggatgtc 


tgcaatgtca 


gcactgattg 








t* na c t t~ rr t~ r* 
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10 


gtgcatctag 


aatatgacct 


ttggtccctc 




aaaagagcgg 


cttcagagaa 


aagacaaaag 


15 


aggtggatca 


ggcctatgag 


aaccacattc 




ttttaggaga 


caggattctc 


ccatagcatg 






V-' ^ Ci y L. U 


aacRcatcict 


20 


tgtatgttgg 


gaaggatggt 


tgtcctgccc 




ttcatgtgct 


gagtctttga 


aggaaaagta 


25 


taccatatga 


gcaatggagc 


ccacatgttg 




ttgtcaagtc 


caagcataga 


tttccctttc 






CXCL^H CI 1 — > — u 1 — w 


Mrrarattt 


30 


gaaggacttg 


ctcaagcaca 


cagatggatc 




ataggaggag 


aggatttcag 


tttctaagtc 


35 


tgaactcttc 


aaaaccctct 


tccataatcc 




gagcatatac 


cagcctcatg 


caccatccat 






g.*H i~ w d y y ci> \~ 
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40 


aagagagaaa 


ggctgctacg 


gagagaagca 




cctccaaggt 


ttgtgggaaa 


agtggctaca 


45 


ggagagagaa 


gcacagggtc 


taagcccatg 




aaatcgaagg 


catatgcttg 


tcatattgaa 




y L L, Uy LLacl u 
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50 


ttccagtttg 


cctccacctt 


ttctctgaag 




ttcagtttac 


cagaggggac 


cgaagtccat 


55 


ttctaaggct 


attgaggaag 


tcatgggaga 




tatcttgagt 


gctatagaga 


gtcaatgggc 




catgcatttt 


cttattcaat 


aagcatttac 


60 


acaagagata 


caaagatgaa 


taaaacacag 
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caaagactca 


tgcttgataa 


ttggatgttt 


21900 


gagaaaatga 


gggtgatgga 


gtaggggtga 


21960 


cagcaccaaa 


ctcaaagaca 


ataccactca 


22020 


aacaacttta 


gaattattat 


attgecctaa 


22080 


cctagaaacc 


caatgtatgt 


cttcttactt 


22140 


caaactgaag 


aagttggaat 


acctacctac 


22200 


accacaccat 


ttcccaactc 


ctcttctaca 


22260 


acgattcact 


agcaaagaca 


ctgatgtgcc 


22320 


cctctctgtt 


taagcatcac 


tggaagacaa 


22380 


ttttctactc 


cagctctgta 


gctatatgga 


22440 


gtcccagtgg 


gcagtatctt 


tgcccaaagc 


22500 


ttctttgtcc 


cacttgtgga 


gcccacattg 


22560 


tgtgtattca 


gtgcatttgg 


gaagecaett 


22620 


tataaaagag 


ataactgaag 


cccagagtaa 


22680 


agtgacagat 


gggtcagtga 


cagagctggg 


22740 


cagaactatg 


actccaacca 


atgaatgctg 


22800 


tgggttctag 


actgeagett 


caatcatcag 


22860 


gtgtcctcat 


ctgtgcctag 


ccaggccctg 


22920 


ctaaatcacc 


tctatgggtg 


tgtggcaggg 


22980 


aaggctttat 


agctgcttcc 


tccctttctc 


23040 


gtctgataat 


atttctgttt 


gtgaatgggt 


23100 


tgcctctgcc 


tcatgccctg 


tcacttagga 


23160 


caaatattga 


ttgtcaattc 


aagatcaatt 


23220 


ccacagacct 


atagataggt 


tttccaactt 


23280 


ctgctctttt 


atgtaaatca 


aagtcatgga 


23340 


attctagaag 


ccctgaaggc 


aacagaatga 


23400 


tgttggcagt 


gacctgagca 


gcactgtctc 


23460 


cagaacccct 


ctccctaagg 


aggaattatc 


23520 


taattatatc 


acatgattaa 


etaaatageg 


23580 


tttgtgccca 


cacaaggaaa 


caggaaggaa 


23640 
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acagacccaa ggaagcacca ttgcagtaca 
gattctgtga tgagaaccca atgcagagtt 
5 aaatgcagtg agttctgtta ctgcaggagc 

agccctgagg actgcctctc aacattgtgg 
ttattcggag tactattatc caatatatga 

10 

tttaacagtg agtagtttca gactgaatat 
ttgcatactc tctgcttttc tggcagtctc 
15 acagaaataa acaaaaactt gtggttggaa 

ggatgtacta tagaaagtca aaagaggaaa 
ccactggtct ttcttaatca caaacgttcc 

20 

acctttgttg tactcatcac aggcagaaat 
tcttcatttg aattatattt taatttggga 
25 gcccctaccc cttaaagatg cttgtcttct 

ctgtacggtg tttaagttaa tcttcaaatg 
aaaacttgaa ccttaagtcc aaatacatag 

30 

cagagagtgg tgggaaggga gacaacggat 
gggcagcatt ttcttggcac ctttctccca 
35 caagccttgc ttattcaatc ctataaatat 

ggccttttaa gaagaaagac ttcttcctct 
ccaaggtagt atctctcatt cccttggatc 

40 

gcaagctttt gcactcctgt gggtctgctc 
ctttggttca ggagcttgag gaagtggaag 
45 tagatagtgt catcctggtg taggacagac 

atattggggg tttaatttaa agaaaatgaa 
agacaactgg agtccaaaag aacttcctac 

50 

aagcagactc caaaggccta ctctttgtta 
atgaaatttt aaaaattaaa aaatatatag 
55 cagaaggaaa aacaaatcta agaagaaaga 

ctaaagccaa atcatcaaac aagagctgag 
tattggtatt agaggctagg atcacacagt 

60 

acagggccat aaagctcata tggagtaccc 
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atggacagag 


gactaatagt 


gtcacgtgct 


23700 


tctcttccag 


gacctgagca 


gcttgacctc 


23760 


aggcctcttc 


ctccttgctg 


acagcatggt 


23820 


ctcagaaatg 


gattatctat 


cctcattgcc 


23880 


aatcaaagat 


tgtctcctga 


ccagtgtcag 


23940 


cctgtcatgt 


gaaatctctg 


tgtctcccct 


24000 


tggttggtcg 


gtccgaatgc 


ctggagacag 


24060 


gaggcctgtc 


tgctgactgc 


atggtgtggg 


24120 


tcatgaagga 


cagtagggag 


tctatcccct 


24180 


aaccttaatg 


ggtttaactc 


ttaaagtaca 


24240 


attcttccta 


ctcatcacag 


gcagaaatat 


24300 


ttcttttggg 


ttcttttcca 


ggggttggat 


24360 


gatcatcagt 


gaattgagga 


ctagcagagg 


24420 


caagcctgta 


tgctacatta 


tgttaggcac 


24480 


ttattttgta 


ctttggtcta 


acagggctga 


24540 


ttgacttcta 


gaaaaaatgt 


caacctctct 


24600 


aacttgatga 


tcactacctg 


ttcgtggtga 


24660 


agtatcatgg 


tgttaaaacc 


ataaaccagt 


24720 


atgaatcctc 


ctagtaggag 


cacagggaac 


24780 


catctatctc 


acaaatctca 


gaaaatgtgt 


24840 


atgtgaccca 


gagattgtca 


aaagcaaccc 


24900 


agcacagaga 


cctgtctgga 


ggaaacccag 


24960 


atgtggtaaa 


gcaatggggg 


tgggagaggc 


25020 


tttcattacc 


ccaaactgct 


tcccattctg 


25080 


aatatgagag 


aagttgcaac 


atagcagact 


25140 


attaacacag 


caattacctg 


aacaaaagta 


25200 


ccaaagttgg 


aagaagaaag 


gaagggaggg 


25260 


tcracttacrqt 


tccaqaaqct 


aagaaggatc 


25320 


gccaaaatac 


caaggaagtc 


tgaataggaa 


25380 


tttaacatcc 


atgacaagat 


ttggcattgg 


25440 


tgcaaaaatg 


ttgaaagctg 


atactgaata 


25500 
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tttactcaat ttgtgaaaat aatataagga 
agaaaacctt ccgtcagctc actaagcttt 

5 

catataaaat agggggacct caaataaaag 
gaaacttcta aggttctagc agaatcaaat 
10 aacccaactt ataagatgcc tccctatcaa 

tgaagttatc taataaccaa aatattatga 
agcggcagca aatggggaaa catgaaagtc 

15 

atttgaacaa ggctattatg aaaatacatt 
agaaaccagg ccaggtgcag tggctcacaa 
20 cgggtggatc acccgaggcc aggagtttga 

tctctactaa aaatacaaaa attagtcagg 
cttgggaggc tgaggcaggg gaatcacttg 

25 

agatcacacc attgcactcc agactgggtg 
aaaaagtaat agaaaccaaa atggaaaaaa 
30 atgtgcattt gagaaaaaaa tgaaatcaca 

aaaaactgag cagatcagtt aaacagcaga 
atggaattta gatttgatta aatcacacag 

35 

atggatgaaa gcctaaaagg tatgatggat 
tagtaggagt cccagtgaga gattataaaa 
40 aaatttttta gaatttaata aaaacatgag 

ctgcaggata cacagaatca aatacacacc 
atagtcaaaa agaaaataat aaaagcaacc 

45 

cattaggtgg gataccaaag ttctgagatg 
ttaagaaatc tgaaaaagaa agcagaaaag 
50 ctctgcattg ctggagcctc atcgccctgt 

gtcttgggta ccagccattc atgtcctgat 
ttcactgtgc tggagctctt attagctgca 

55 

tactccaaca accctggggt gagtatgctg 
gtccaggcta caataatcca acctcaaaaa 
60 tgaggtgcat gtggaaggcc acttttataa 
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aaattctgcc 


caacatccca 


agaagtttca 


25560 


gtgcaagtgt 


gccatctgaa 


ttcataatta 


25620 


cattaagtta 


aaaactgtta 


cgagatatgt 


25680 


ctaaaactac 


tatctaagga 


gatgtccgta 


25740 


agaaccctac 


cactaccacc 


atctacaaac 


25800 


gaaacataac 


aacagcacaa 


aaaaatgagt 


25860 


agaagctcaa 


gaattggaga 


taacataata 


2592 0 


ttaaatcatt 


aaagatgttt 


aaaaagtaat 


/"N. S~\ 

25980 


ctgtaattcc 


agcacattgg 


gaggctgagg 


26040 


gaccagcctg 


gccaacatgg 


tgaaatgctg 


26100 


gagggtggta 


catgcctgta 


atcctagcta 


26160 


aactcgggag 


gcagaggttg 


cagtgagctg 


2622 0 


acagaatgag 


accctgtcta 


aaaaaaaaaa 


26280 


aaaaaaagga 


cagtaaagga 


aaaaaaagat 


26340 


tacttggaaa 


tgaaaattgt 


ctttgaaatt 


26400 


ttcaatatat 


tcaaggaaac 


aaagagtgaa 


26460 


aacacaacac 


attatgtgtg 


tttggaaaag 


2652 0 


gcagggaagt 


ggagaggttc 


agtacatatc 


26580 


gagagagtat 


atgcaaaagt 


aaataactag 


2664 0 


atttcagatt 


gaaggaatgc 


attgagacca 


26700 


tagaaatatc 


atcatgacaa 


taaaaacacc 


26760 


agagataagt 


ggcagattac 


tctcatctat 


26820 


atgccgttgc 


tgcttatttc 


ccttcatcta 


26880 


aggattaagc 


aattttccct 


gtctcctttt 


26940 


ctcctcttag 


ccaggcacaa 


gggctagctt 


27000 


ttgcagggtg 


tcctagtggt 


gatgctcatc 


27060 


tacagttctg 


tcttttggtg 


gaaacagctc 


27120 


acatgtcgcc 


tgatacctgc 


tgtgtctcag 


27180 


gtggcaaaaa 


gaagaatcaa 


ttattgttca 


27240 


ttaaaaaaat 


gagtttaaca 


gtgaaacccc 


27300 
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atctctacta aaaatatgaa aaactagcca 
tagttgggag gctgaggcag gagaattgct 
5 tgagatcaca ccactgcact gcagcctggg 

aaaagagttt gtataaatgg gctccttctg 
gtgtctactg ggaagcagac cagttagaga 

10 

aaatcacata gcccaacccc tcatgaaaga 
tgaatgacac agactattgg gctgagaatg 
15 acacagctac ctgcctcccg agagagtctc 

gactttacta gaatgtctct gtcctgttta 
tagctcttga tgttataatt ttaaaatatt 

20 

agaaatgttt atagaatttg gatgaattat 
aaataattag cagactggag aattgcatta 
25 tatggcaaag ttggaagaac tagtactaca 

tcttttttaa tctccgaaat caaagtttaa 
ttcattttcc tcgacccagt cacaagatca 

30 

gtcttggata taagtaactc ttggcctcag 
caagtgtgat tagactttcc tgaaatctct 
35 aaaaaaaaag cttttgtttt gtatttgttt 

aataatttcc tcaaagccca agtcaataaa 
aaactttata aactgctttg ggtaaactga 

40 

gcactcatgc tagtgtgaat ttggtaagtc 
tttcacactc atattgctta aatattacat 
45 aattaaagac tatatataat taaactactc 

actgggccct ttttcatgag ccattaggtg 
taggattacc tctcaacaca cagcaagcat 

50 

aatccaaatc tctgcaatcc tgtggcacca 
tggccttctt ggctgggcgc ggtggctcac 
55 ggtggacgga tcacccgagg tcaggagttt 

tgtctctact aaaaatacaa aaaattagcc 
ctacttggga ggctgaggca gaagaatcac 

60 

cggagactgc accactgcat tccagcccgg 
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ggtgcagtgg 


cacacgcctg 


tagtcccagc 


27360 


tgaacccagg 


aagtggaggt 


tgcagtgagc 


27420 


tgacagagtg 


agactcagtc 


tcaaaaaaaa 


27480 


gaggacactc 


tggtcatctg 


ggatcagctg 


27540 


attgcttaat 


atgaagctag 


ttggttttag 


27600 


accacggctt 


ggataaaatt 


acatagaaaa 


27660 


agaacgcgta 


tcccctcact 


tttatcggga 


27720 


cctagccaag 


tttccttgtt 


aatgtgatga 


27780 


tgcttccctg 


tgctcttcct 


tacatcccct 


27840 


ttcatttccc 


ctaaaggcta 


caacaacaaa 


27900 


gatttaacta 


tgatcctatt 


taaaatagta 


27960 


tttttaatta 


atattgctgc 


aattacatat 


28020 


caaaacttgg 


ggaaaaggta 


ttatgagtta 


28080 


aaattctaac 


caattatgta 


ttttctagag 


28140 


tatccaacag 


gtcaaaaaga 


gttcttcacg 


28200 


aggaaggaaa 


agcaactcaa 


cactcatggt 


28260 


gccattttag 


atactgtgaa 


acaaactaaa 


28320 


actatgagtc 


gttatttaat 


ttctcttgaa 


28380 


tgttatcagc 


cagtcttcca 


aaatggtcat 


28440 


gcagaaggtg 


atacacagaa 


gggaaaatgt 


28500 


gcgtgactct 


gcaggctgtt 


tctgtattat 


28560 


attagggatt 


gtaagaaaac 


tttaattaaa 


28620 


tgccctggac 


actctctgag 


aaacagatct 


28680 


gaagacagca 


aagagatctt 


ctcaagtgct 


28740 


gtccctgcct 


cccatgaatc 


ttaccatgta 


28800 


accagtggtg 


ctcattctat 


tgtttaaaaa 


28860 


tcctgtaatc 


ccagcacttt 


ggtaggccga 


28920 


aaaaccacrcc 


tqqccaacaa 


QQtcraaaccc 


28980 


aggcatggtg 


gtgcgcacct 


gtagttccag 


29040 


ttgaacccgg 


gaaggggagg 


ttgcagtgag 


29100 


cctacaagaa 


caaaactcca 


tctcaaaata 


29160 
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aataaaaaaa taaataaaaa taaaaataga 
aaataaatga ataaaataaa ataaaaaaga 

5 

ctcttctgtc atttcagggt aggagtcaat 
taatctctga ctggcatgtt ttctgcttct 
10 aagaataatg tctatttctt tgtttgggta 

tgtaattcag atcatcaccc ctgcactttg 
cattgctgca attatcttgt ctgttctttg 

15 

aaactcgatt tcttgggcaa ttatggaaat 
ctctatgcat atgaggtttc ctaaataaat 
20 cttataactt tctacaacaa agaattattg 

ataccctgct atcgatggtc gctacaaacc 
ttggttttag ttttatgaaa caatttacct 

25 

tttaggaata taaaggatca atatgggaag 
taattagtat ttgtgtagtt caaaccagga 
30 ctttatgaaa gctaaaatat tctctgttat 

tctttactac tgattttttt ttgtttaatt 
aaatgttctc actacaaaaa aagaaaaaag 

35 

tagctggatt gtggtaatca ttttggaatg 
taaatatata taatttttat ttgtcaaata 
40 ttcaaagtta tctactgatt tttataatac 

aattcatcaa gtaaaagaaa ctcctttcaa 
aacatagatt ttggagacag gtcaagttga 

45 

gggtgacata accttgctgt tcctcagttt 
gttagtgcat gtcaacacct tctacacagt 
50 ttagttcctt tataactcat gaacatatgt 

tctttgttta ccatatgttt gtaagcaaat 
acttttcacc aggaagatgg ggggctgagc 

55 

aactatgatt ctgagattct actactgagt 
gctccatgtg actctggtgg agaggtagat 
60 tttcccttac tagagttctg ccatagaatc 
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ctatatatag 


tctttgaatt 


catt tatttt 


29220 


atattcagtc 


gttggcacat 


aaactatgtt 


29280 


gaagacatct 


tagtcttaac 


aagttacaga 


2 9340 


agcttggagc 


taaatgctgc 


tcatgctgaa 


2 9400 


ttagctctat 


tttgatctat 


agcacagttt 


29460 


taccacatgg 


gcgtattctt 


tcttcccagt 


2 9520 


tgtaatatgt 


tctgtgtgag 


cctctgcatt 


29580 


tccagtgtgg 


ctgcagttta 


actttgcact 


2 9640 


gaggagtagc 


atagtttaaa 


atatatatat 


29700 


agtccaaatg 


tcatcagtgc 


tcattttgag 


29760 


aggaaatact 


caagttatta 


tgtgtataca 


29820 


tcatgatctc 


atagttaaaa 


ttgtaataaa 


29880 


caaaatttct 


aaaggcagtt 


tctgttgttt 


29940 


aggatttgac 


tatcattaga 


ttttgcttaa 


30000 


aaaggggcaa 


ctccatctgg 


tcctatagca 


30060 


tgaaaatgca 


aagaattgtt 


aaatgttctt 


3 0120 


ataactacgt 


gaggtgatgg 


atatgttaat 


3 0180 


tatatgtata 


tcaaaacatg 


tagtacaccc 


3 0240 


tacctcaata 


aagatggaaa 


aaaatcgaaa 


30300 


agagaaatat 


ggttaaacag 


taagttagaa 


3 0360 


aggagggcag 


aagcgtggct 


gagtttaaaa 


3 0420 


gtttgcccct 


tccctgtcct 


gtatgtcttt 


3 0480 


cagtattgta 


gaactgttaa 


aagaataaat 


30540 


tcccagaaca 


taaggaatat 


tccataagtt 


3 0600 


gtaaggactt 


gtttcgtata 


taccatctat 


3 0660 


tgacaagaga 


gtaagtcttc 


tggatacagt 


30720 


atgctctaga 


attctaaaac 


tctgtgactc 


30780 


agagtcatca 


ctaagggctc 


atctctgagg 


30840 


catgatttgg 


gcggcaatgt 


ttgctcactc 


30900 


atggagtcaa 


catcccagga 


cagaagggca 


30960 
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actcacgtca 


tcactataaa 


accaaacgaa 


actgtattga 


ctgcatttcc 


ctacagacct . 


31020 


catagctctc 


tgctggattt 


tctgaaggga 


gagccaagag 


tcttgggggt 


aagtcctctc 


31080 


cagtcaatag 


gtcttccaga 


aggggagaaa 


agaagtttat 


ttattcatgt 


atgtctgcag 


31140 


agatatgcag 


agattatttt 


cagggaggga 


gttaattcta 


cttttccccc 


ttcaccttac 


31200 


atgagaaatt 


agtactgatc 


tttgaatccc 


agcagagggg 


gaagcaagaa 


actagtataa 


31260 


ctttttaaaa 


tatctcattg 


aggaagtgac 


ttcagggtgt 


ggtggcacct 


cccttgaagc 


31320 


taagaaacca 


taagtggaga 


ggtttgactt 


ccccttcatg 


ctcttaaagg 


aattgggagt 


31380 


gcttcaaaac 


ccatattcaa 


agctccacag 


tgaaccaaat 


gataaggcat 


gcaaaacgtt 


31440 


tttcatcaca 


aatgcaccca 


ggggagagct 


gtatgcacag 


cctccatcct 


ttgtacaaca 


31500 


gcaaaaatca 


ttacttggtg 


tgccacagcc 


cagagatgtg 


gcctggggac 


aaaatctaga 


31560 


ctgccagaga 


agttgcgccc 


tgagaaccac 


tgctgctatt 


cttttccctg 


cagattccta 


31620 


caagctgata 


ggcccactgg 


gctattctca 


tccctagttg 


ggtccatgga 


tgaaggggag 


31680 


gtgaaatgca 


agctcatccc 


aggttctcca 


ctctatgtca 


tcaaactttt 


taaattttta 


31740 


gacaactggg 


atgagcccca 


attccaaaag 


cctatactga 


gtgctggaaa 


atgtgggggt 


31800 


gggggctgtt 


aggataataa 


taggatactg 


gtgagacaga 


cgaaaaaaca 


aatgtaaagg 


31860 


tgaatagttt 


atagtccctg 


tgctcaagaa 

* 


gcttataatc 


tgggggtagt 


cagacatgta 


31920 


ctcaaatctg 


tctgatataa 


gattaattgg 


taacaccagg 


gaagatagaa 


aataataaag 


31980 


aataaaagtg 


aaggatttga 


acctgaagga 


tccaggcaag 


caaagtggga 


aggaaacaca 


32040 


ttgaactgag 


tctcttaaca 


atctgagaat 


gattgacaaa 


tgagagagga 


aggcatttgg 


32100 


atggagggaa 


aggtgtgcca 


tggaggcaca 


ccctgaggga 


cgtggggtag 


cccacagggt 


32160 


cttggatact 


tgtaacctca 


tgtacccatt 


tctctcctca 


caggctaccc 


agatcctgct 


32220 


tgctctaatc 


attgtgggct 


ttggaactat 


atttgcactt 


aattacatcg 


gtttctccca 


32280 


aagacttccc 


cttgttgtcc 


tcacaggata 


tccattctgg 


ggagcactta 


ttgtgagtac 


32340 


tgtcgattag 


aagctttgga 


gaaattgcta 


taaagataga 


tttgaactgc 


ttcacgtcct 


32400 


agggtaatga 


attccataaa 


tatggccctc 


caggagacca 


aaaaaggcag 


ctcccttcac 


32460 


ggcaaaagaa 


gcaactccat 


ttggtcactg 


cactaccatc 


atcattgtaa 


taactactct 


32520 


caggtgctta 


aaatttgttt 


ccaaagccct 


tgtgtgtata 


gtaagacatt 


tgctcctcaa 


32580 


taccaacc tcr 


QQaQactacra 


fccrcacaaata 


tcctacttat 


tcctttqcaa 


ataaggaaan 


32640 


tgagctcaca 


agtagagaat 


aacttaaaca 


tatctagcct 


acctagcact 


catggaatca 


32700 


cagactctga 


aagcttaaaa 


agaattttaa 


aattattatc 


actcatcctc 


cttcttagcc 


32760 


ccagagacaa 


ctttatgagt 


ggcactcatc 


tagataatca 


tatagccata 


aatattattc 


32820 
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ttaatacagt gctatttcta tccaatcccc 
tatttaagga cagactcatt agagcttctc 

5 

cagtgttact gtctgtttta gttgttatat 
agtgtgcatg aacgaacgaa tgaatcaatc 
10 ttaataaatt aggccactat ctctgactat 

aaatactcca tgccttttct ataatgtgca 
tcccaatata aataatgtaa tattcaatat 

15 

aaaatcaaac agaccaaccc ctcactttct 
tttgtgtact ctgcatgtgt gaattaaaat 
20 tacttttcac ttaaaagata tttactgagc 

gctgagactg agaatactgg aaaacagctc 
gatttagaca gtttgtcatc taactgggag 

25 

aaatatttgt tcctacactt gtgctaagtg 
agggtttaat agaggactca cctgaccatg 
30 ctcactaggc gaagtggaag gaaagcatcc 

tgatgtggga aagagtaacc aaaagaagac 
ggctataaag gtagggtgta accaggtcgt 

35 

gaacttcaat gtgaaagcat gggaaagcca 
tgaagacttg ttgaagtatc attgtgtttt 
40 taaacctagg agaatccaat gagaggaggc 

cgaactagct tggtggcaat ggagctagag 
gaagtgacag gacttggcaa tccaatgtgg 

45 

actttctgat ctctgcaagt cagctgcaga 
aggtccaagg ccagttccca cagagaaggt 
50 gactccacat ggagtgacag agttctcatt 

tgggaaatga gggactctga taactacact 
gggtcttcct tgagaaagtc tcccttcacc 

55 

tcttcaaggc ccagagctca ggaatctctt 
tcctctggat cttagaggag cctgcctgtc 
60 agtgctgagc aaatccagca caaagtttct 
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acagcctgga 


ttcctaattt 


tgtgccataa 


O O O 

3 2 8 8 0 


ctactataac 


atacactcct 


gagaagacag 




tctgagaacc 


tagaatcatg 


cttgaggcat 




aatcaatcaa 


t caacaaata 


ggtgaaacaa 


O "3 f\ cz r\ 

5 ,5 0 o 0 


caaaaaccca 


cagtctcttt 


tgataaacag 




cttcagaaac 


aaccagcagt 


gttttcattt 


T3 n Oft 


_i_l_.k_.L__.___ 

gattttcaaa 


ctzatacatigc 


catactgtag 


"3 "3 *"5 A f\ 


y— 4 **> 4— 4— 4— y-_a y»«* 4^ A 

cattccacaa 


cagaattgtc 


ccagugcgac 




gagattctgg 


catgetgett 


ttgaaaactc 




acataccgcg 


tcttcaatgg 


y-^ 4 s y^i y^« 4^— ypw^w 4* 

catcctaggt 




4"* 4™ J*^_ 4^— yW*M y™^ 

ccccctgaga 


gaaguggega 


attgttctua 


1 *3 A Q n 


actagacatg 


aaaaagtcaa 


aagtaacccc 


j .3 b4 U 


ctctaagagg 


agagttccag 


1 y ■■ y 1 y «■ — • 4 /"r 4 " y — » 

9 t 9 c 9 at 9 t: 9 




g a ggtcaggc 


aagtgctttc 


4V— y ■ ^— m y>»v 44— yv y*^ 4- yr 

tgcgtagctg 




c tggcaaaga 


taaagcatga 


gecaaggget 


j j /ZD 


caacgaacct 


aucgtgacgg 


cataaaatga 


J J / OU 


gccaacccct 


atagtcatgt 


taagaaaatg 


"3 Q Q /I O 


ctgaacgtct 


ttaaagagga 


4-^ y"*r 4~" y"^C ^ y^ — ^ y^i ^— 

tag tgacact 


*3 *3 q n r\ 
,3 -5 y U U 


tagagugaga 


aattaattca 


attataacag 


3 3 n 


tacaaacaca 


cagatgagga 


— % y-^*- y»^r 4/— 4~ 4^" J mm % y^l 

aagagtttcc 


"3 /i n r> n 


auggagagcc 


ttgagagtga 


ccc gg a gg^ a 


•3/1 fion 


ggagugaagc 


ggagggaggc 


4— ^ » ****** 4— 

atCaaayy La 


J J. ft U 


tec u tggact 


ctaagaccca 


gatgtcccca 


-3 A o r* A 
J4ZUU 


ccccaucccc 


agaacaticaa 


aaaagtaagg 


j4ZbU 


_— — JU_» _P__* _— _ _— __» _— _— — 

ttcc cgaggt 


ggtactaggc 


tctacgatca 


J4 J<iO 


aga cat: cage 


yp*| 4- 4— y**"l 

ccccatatca 


gectttagag 


"3 A "3 Q n 

j 4 j o U 


tgctcagaac 


aggcacagct 


cctcctcttc 


34440 


tttggagccc 


cagccggatc 


aaacatctgg 


34500 


ccagttggcc 


ctggcagaga 


gcgcttgttc 


34560 


tcctacccta 


cctacgccta 


ccttccctac 


34620 
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tccctgatgc tcaggtggaa aaacgggatg 
aaatccaagg gaaattatac tgcaaggtgg 
5 ttgtcttctt aacagtttat tcttacagga 

cttctggtaa gccacttaaa ctacataaaa 
tttatgtatg aagcctttag taataaagtt 

10 

ttgcagttca gatcattcct atttgccagt 
cttgttccca tttccctggt caagtaaact 
15 atccctgtaa ttcattgttt gtaattaggg 

cagaaatatt ggacaacatc tgaagacatt 
caagtgagta gaggccaggg acatttctaa 

20 

caacacaaaa attatccaac aaaaaatgtt 
aaccaaaacc aatcctaagc tctcactaat 
25 taccaaccga ggcttttctt tagatcctgc 

aatgctggat tgttctagga ttgaaccaag 
ggaaaagtgc ctacttctaa gggttatttt 

30 

gcaagatgat gcttaaatgt tcatactcct 
t9999atgtg ggcctctctg aaagtcagtt 
35 tcctctaggg atagttacaa ggcatagaga 

catccctgct tatggaaata catgcagaca 
ctgtattgac ttccttccag agctgagaaa 

40 

actcactgag agctatgcag atggaggtgt 
aggcatcaga agacctggct tttcatcaca 
45 cacagtcact tacttcctta attccattta 

ttcctacagc acaaaagtat aatgagaatc 
aaagcccaat acacatgagg gattgattat 

50 

gggagccaaa atagagcctt gagaaaacaa 
atctatatgt tcctgtgcat tttctgccaa 
55 gaggtaaaca ttaagtcaca agtacacttc 

cagaaaggac tggctgtgta aaggcatcat 
atcaataaag gtacagcaaa tacctgatga 

60 

ctttaaaaga gtgatatcag tcctgggagt 
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gtcttggaga 


ttcacaaaca 


gatattttgc 


34680 


tcaaacacat 


cacctgttaa 


cttttgtctc 


34740 


tacctcacag 


taaccgataa 


gaaatcaaaa 


34800 


tttaaaatct 


tctatttgtt 


ttatatcatc 


34860 


acctttcagg 


aaaatctgta 


tgcatcaaag 


34920 


gccactgttt 


gcctgacaga 


cattcttaca 


34980 


aaaagagttg 


tttattggaa 


accattctgc 


35040 


gtaattccct 


gccctgcacc 


ccccaggacc 


35100 


tttggtcatt 


acaactggtg 


ttattggcaa 


35160 


gcatctaaaa 


tgcacaaggc 


agcctccccc 


35220 


tatagtacca 


aggttgagaa 


accttgctct 


35280 


tcttcctccc 


aatcctagcc 


ctatctccaa 


35340 


ccaaggtctg 


gaagaaagag 


tttgggcatc 


35400 


gctatcactc 


acttctgtgt 


taacctgaag 


35460 


gagagtttaa 


tgagacatta 


ctgataaaag 


35520 


ttctaaccac 


tctgatcttg 


tttaaggtcc 


35580 


gaactctgtt 


cttatggtcc 


tctgccagtt 


35640 


cattctgagg 


ggagggcaga 


taaatgtttc 


35700 


atgcagatga 


agccccaaat 


gctgttctcc 


35760 


taaactatgg 


gtaaggaaaa 


cctactcctg 


35820 


gcaggaatga 


tggaggcagg 


atgctgaacc 


35880 


gcccttccac 


aaaccgtgaa 


agtgattgct 


35940 


tctgtaaaat 


gtggataaca 


acaagtgtcc 


36000 


agaagagact 


aggctgggca 


tgttctttgt 


36060 


ctagagatct 


tagtctataa 


aatgtcctat 


36120 


ggggactaaa 


gaccaagaga 


gagatggcct 


36180 


agtgcctatt 


tgcaataaaa 


acctgggatt 


36240 


tqaqgtttqc 


cacaccaagc 


cacaaacttc 


36300 


tgtattttcc 


accacggtag 


tcataacaca 


36360 


atggaaatgt 


gaagcttttc 


tttgtttttc 


36420 


gggtaagaag 


gaagaaatca 


gatgaccccc 


36480 
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ttcctgtaaa 
aagaatgaat 
tcaagaaaaa 
ttctgttgat 
tgtcacctct 
ggtgcacatt 
gacacattca 
aggtcctagt 
acacattgta 
taaaatgaac 
gtttgtcttc 
cctgatggaa 
tacaagatat 
cagggtcaag 
actttcacca 
gaaatatgtg 
tccctaaaat 
taaatgaagt 
tacaatggtt 
ttggcctaat 
tgagaatgct 
cagttaaata 
tttactgttt 
caaaatacag 
ttaagacata 
tcttttacca 
gtgacattat 
caagtcctgg 
ttgaagaata 
ctcagtaaat 



ggaactcgga 
aaagggaatt 
acacctttaa 
tattgcttga 
tttgttcatt 
aaatatttat 
catttttaaa 
gaaacttgga 
ctgcttccag 
attttcttta 
tgactttaat 
caactcttcc 
ttttgctttg 
gtgtcacggg 
ttctcagcta 
ttttcagtag 
cttctctgaa 
tgtgtcagca 
aaggagcgat 
attgtattct 
tacctcatgg 
gtcagtaggt 
cagcattaat 
tgggggaagg 
cattatttca 
ggaaatgtag 
ttttaagcat 
ggttttattt 
tttgaatccg 
aagaaatgag 



tatacctacc 

gggtggggca 

actccagtcc 
attctcccag 
tttgtttctc 
gaaataaatg 
ggcagaaatt 
cttgaaatga 
attatgaaag 
gcattagcca 
gagttagaat 
ctgagaagta 
gtttcgatct 
catgaatgtt 
cagacatcaa 
aactcttttc 
ctgtgtccat 
tttgaactga 
attttctgga 
ctctaagctt 
gttgttgtga 
ttagctaata 
gaacaaaatg 
tatttagaag 
cataattgtg 
ttgtctgtgc 
gtcagtgact 
tattcctacc 
ttagaaggga 
aagttagttc 



ccacccaccc 
gtgtgaaata 
aaataggtta 
ggaatgactg 
cagggcctaa 
aataaatgga 
gagattccag 
ggtttcattt 
atggtggtga 
tagagcatcc 
ttagtttatt 
tctggaaaga 
tatttatttt 
atcagctcct 
gacaagtact 
attgtaagtg 
gatgtaatga 
cggttgagaa 
cttgaatcct 
cggttttcct 
ggattgctta 
caattattat 
atctaaaagt 
tcatttaaaa 
gcaattttat 
aatttaaagt 
ccaataggcc 
agctatatct 
aataaagtgg 
ctgaatacaa 



acacccccct 
tttttattgc 
acatctttaa 
taaaccacat 
tgcagtactc 
tactacccta 
gagattaggt 
ccaagcaaag 
taaaatgaga 
ttagcccatt 
aagccttcca 
taaaagcact 
atcattttta 
tggttgcgat 
gccagatgcc 
gtcttatttt 
ttcactgatt 
gggtaccatg 
gttcacacaa 
caagtggaaa 
aattaaaaca 
ggccatagat 
ttgctcaata 
aaataaggat 
tcattaaagc 
atttatctaa 
tttctctcta 
tgaagcatgg 
aatcacagaa 
gaggtaattt 



ccaaaaaaat 
aacaatgagt 
atgttaacga 
gattgcagga 
acatgtatct 
aaagttaggt 
tacttgccta 
atatttgcta 
taaaatggga 
ctaaaacttg 
atcatactga 
aaaaatttct 
aactatcttt 
aactgggatt 
atcctttgaa 
gcatgaagaa 
tttattttat 
gggttactgt 
gcttgtcacc 
atgaacataa 
gtaccttgca 
tgaattaaat 
acactgagat 
ttttataagc 
agatgagaga 
caagtaatca 
gttccctaag 
gttctataat 
aatggtcttc 
aatcctcaca 



36540 

36600 

36660 

36720 

36780 

36840 

36900 

36960 

37020 

37080 

37140 

37200 

37260 • 

37320 

37380 

37440 

37500 

37560 

37620 

37680 

37740 

37800 

37860 

37920 

37980 

38040 

38100 

38160 

38220 

38280 



-238 



WO 02/062946 PCT/USO 1/48437 



atcactcagt 


gaaatggtcc 


cagggactct 


ctcatgtgtt 


ctgagataaa 


ttctaaaatt 


38340 


agactcatga 


tagattacta 


aagaaagcca 


gaaaacaaca 


taagcgataa 


aaccctgtcc 


38400 


taactgggcc 


aagaacatga 


tacaacatta 


taattcttgg 


ccattgctga 


atggacagga 


38460 


ggcaactact 


gtttgttatg 


ttcgtatata 


cgaggaagaa 


cttgtatata 


cccgaaaggg 


38520 


ttaaatctcc 


agatatatga 


aattccgtat 


ttcctgctgt 


ccatggatgg 


gtattgcaag 


38580 


attagttacc 


tgcatacaga 


accatgaaca 


atctcttcct 


agttctcact 


tacataactt 


38640 


aacaatttct 


tgctgtaacc 


caaacatcat 


aagctaaaag 


agccagaaca 


gtgggcttta 


38700 


gatgtctctt 


gggctgaccc 


ctgtcctaaa 


gaccaatcta 


ttgtctatac 


tgtggctaga 


38760 


gtaatcattt 


aaaaatgtaa 


atctgagcca 


ttacttaatg 


gcttcctgta 


aaacgtatga 


38820 


tgattaagac 


ccaaatctta 


acagccttac 


caaaccctgt 


aacacctatc 


ctacctatct 


38880 


ttccaacatt 


attccaggac 


ttcatccttc 


tcactcctcc 


acactccagc 


cacttcctat 


38940 


aatcccttaa 


tgtgtcaagt 


atccacggcc 


ccaagatgat 


cagaggatca 


gcttcttccg 


39000 


gctttgaacc 


gtcaccaggc 


cagttataca 


attaactcat 


cctcatactt 


aaatctcaac 


39060 


tgaaatacca 


cttctttaag 


gaagttcttc 


ctgacctcac 


ctctatgctc 


atatctgcat 


39120 


agatccccca 


ttagagtagc 


gcatagctcc 


caatccttct 


tcacagcatt 


tatcacaatt 


39180 


tagttaggta 


aataattatc 


taattacttg 


ttattgtctg 


cctcccttac 


catgatgtaa 


39240 


gttctacaaa 


atccaagacc 


aggtatatct 


tgttcaatgg 


agaaagtcag 


taaattgcat 


39300 


acaatagggt 


cttcatacat 


ttcagtttta 


ccaacaaata 


ttgaatattt 


aagattttgg 


39360 


gtgtattgca 


gggaaaccaa 


gattcagaaa 


cacacagtta 


tttgttcaag 


gttaagtggg 


39420 


actttggacc 


cacttttttt 


tttttttgag 


atggagtttt 


gctcttgttg 


cccaggtttg 


39480 


gagtgcaatg 


gcacaatctc 


ggctcaccac 


aacctccgcc 


tcctaggttc 


aagcaattct 


39540 


cctgcctcag 


cctcccgagt 


agctgggatt 


acaggcatgc 


accaccaggc 


ccagctaatt 


39600 


ttgtattttt 


agtagagacg 


gggtttctcc 


atgttggtca 


ggctggtcgc 


aaactcccaa 


39660 


cctcagttaa 


tcctcctgcc 


ttggcctccc 


aaagtgctgg 


gattacaggc 


gtgagccacc 


39720 


gcgcccggcc 


aaccttggac 


tcactgtaac 


tcaggtcttt 


ctgtcttcat 


agtttaaact 


39780 


tttgagacac 


cataaggtac 


catgttactt 


ctcaatataa 


acattacctt 


aaagcaaaat 


39840 


gggcaaaatg 


gcaatctgtg 


ggctatatcc 


agcctgtata 


tatgttctat 


ctggctcttt 


39900 


agggaaaaac 


aaacaaaata 


qqcattatqq 


tatccatgtt 


cccacatgac 


agcaattggc 


39960 


tggaattgag 


taatggatgc 


ctcctatgaa 


tactattact 


ctctactctt 


tccctgttta 


40020 


gttcacaatg 


actgcccagc 


tcccatggac 


attggagttt 


gttatccctg 


cctcatcaga 


40080 


aagacacaag 


atgaaaattg 


tgcagtgtct 


gatggctaca 


gaaccatcca 


aaatatgggt 


40140 
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ttcccatact tggtgataga gtgtaagggg 
ctcatttcaa ataccattta tttctcctca 

5 

gatcatcagc atagcagagc tcagcatctc 
ctggacacag tcagatgagg tgtgttctga 
10 cctgttcctg ttgggatact gcacacaatg 

tgctcagatg ctgaaataaa acatctaaat 
attgaagctc accccaccat ctgaaacaac 

15 

atactcagtc atctctgtta agacatagct 
cagacttaac aggaacccaa attactgtga 
20 gtgtgacaca gttccaccac cacaaaagaa 

acatagtatt tataataatt gttatagatt 
ctgttataat cactgagcac caacttaatc 

25 

acatatgaag gaagtaaggc acaaagaggc 
caggattctg acccctgtga tatgtctcca 
30 acaggccttg tgatttatgg agatttcatt 

caactttata gaagtatcat ttataaacaa 
aatcaaggta tagtgcattc ctagcatccc 

35 

atgttctcat cacctagatt ggttttgact 
gtaagtactc tctgtatatg gcttatatgg 
40 gtcatccatg ctgttgcctc tatcaagaat 

tttgtttatc aatttacctt tttttttttt 
ctatgaccca ggctggaatg cagtggcact 

45 

gactcaagca acctttccac ctcagcctcc 
cacccccagc taaatttttt ttgtattttt 
50 caggctgctc tcaaactcct tggctcaagt 

gggattacag gtgtgcgcca ctgcgcccag 
gtttaaagtt ctgaactatt atcataagtt 

55 

tggacatatt ttcacttctc ctggagtgtt 
gtaaccagat gtttaacttt ataagaaaat 
60 cttttcattc tcatcaacaa tgtgtgagag 



ttctcagctt 


atatatttaa 


aataacagct 


40200 


ttatttgcag 


ggaattttgt 


taatcttact 


40260 


tgtgactatt 


gcatccttta 


gaagcaagtg 


40320 


tgtctcttaa 


tattattaag 


agaacaggtt 


40380 


atgtctgatt 


tattagtagg 


aaaagcagca 


40440 


ttaggaaggt 


cagagagggg 


caagcttaag 


40500 


tatgctttcc 


tttcttgtac 


tatctgcttt 


40560 


ccttatgtct 


ctgccaaaaa 


tatgtaatat 


40620 


catggggtgt 


atattcttgg 


atcataagat 


40680 


ataggaaaaa 


aagctgttac 


tactgatact 


40740 


atatttactg 


aattctgact 


atgtcagtca 


40800 


ttaacaaaac 


tattataaca 


tctttatttt 


40860 


taaactaacc 


tctccagtac 


cacataacat 


40920 


gagcctgctc 


actaagtccc 


aaataccagt 


40980 


tctctctcta 


gctctctctt 


ttttttacat 


41040 


caaaatgtac 


ctatgtttag 


ctaccatcac 


41100 


ggaaggttca 


ctggaacccc 


tttgtagtca 


41160 


ttccaaaatt 


tcatattcat 


ggaatcataa 


41220 


cttcttccac 


acagaaatat 


gtttttgaga 


41280 


tcattccctt 


tatgtatggg 


tacaccacaa 


41340 


tttttttttt 


tttggagtcg 


agacctcact 


41400 


gtctcggctc 


actgcagcct 


caatctcctg 


41460 


caaatagctg 


ggactgcaag 


catgcgccac 


41520 


tttgtagaaa 


tagggttttg 


ccacgttgcc 


41580 


gatacaccca 


ccatgacctt 


ccaaagttct 


41640 


tccatttacc 


ttttgatggg 


tatttagatt 


41700 


actaagaata 


tttgagttca 


tatatttgtg 


41760 


gcatctagga 


tttgattgcg 


agatcgtatg 


41820 


accaaacagt 


ttgtcaaagt 


ggctgtattg 


41880 


ggacagttgc 


tccatatcat 


taacacttga 


41940 
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aactgtcagt 


ttattaataa 


tttagctatt 


ttagtgggta 


tgtagtggca 


cctcattgta 


42000 


gttttaattt 


acatttccct 


aatgactaat 


gatgatgaca 


ctcttgctat 


atgtttatta 


42060 


tccatttgtg 


tatcttcttt 


tgagatgtct 


gttcaaatat 


ttagctgaat 


tttattggat 


42120 


tatcttctta 


ttggccttat 


agcagcctaa 


ttaattctta 


atacaaatcc 


tttacaaata 


42180 


ttttttcaca 


ctgtgacttg 


ccttttcatt 


ttcttaatag 


cttctttgaa 


aaaaacaaat 


42240 


ttgatgaagt 


tccatttatc 


atttttgctt 


ttatggttat 


ggtttttgtg 


ttctgtttaa 


42300 


caggttttcc 


ctacccccaa 


atcatgaagg 


ttttcttcta 


tcttttccta 


cagaagtttt 


42360 


ataattttag 


cttttgtgtt 


tagatgtatg 


gtctatctta 


actttcatgt 


acagttgtag 


42420 


ggtaaggtta 


gaagttaatg 


cacatccccc 


ctcatggaga 


tatccagttg 


ttaacgccat 


42480 


ttgttgaaaa 


tgcaatgctt 


tcctccatca 


ccttggtatt 


tttgttgaaa 


atcaatttac 


42540 


ttattattta 


tttgtaggtc 


tcttttggat 


actttttccc 


tctggccact 


acgtctatcc 


42600 


ttccacgtat 


tctacactct 


cttcatttct 


gtgtctttag 


agtaattctt 


ttgcattgtg 


42660 


gtgaaatata 


tataacataa 


tagttatcat 


tttagtcatt 


tgttcaaatc 


attgaataca 


42720 


ttaataatgt 


tgtgtaactc 


tcaccactat 


ctatgcccca 


aaatttttca 


tcatccccaa 


42780 


cacaaactct 


gtaccacaaa 


acaatcactc 


cccgattccc 


ctcctgattg 


cccctggtac 


42840 


cctctatcct 


actttctata 


ttatgaattt 


gccttttcta 


gagacctcat 


ataagtggaa 


42900 


tcttacaata 


tttgtccttc 


tctaactggc 


ttatttgagt 


aagcataata 


ttttcaagat 


42960 


taatccatgt 


tgtagcatat 


atcaaaattt 


cattccttct 


atgactaaat 


aaaattccat 


43020 


tttgtgtata 


taccatattt 


tgcttagcca 


ttaatttgtt 


gatggacaca 


gatatttctg 


43080 


ccttttgcat 


attgtgaata 


atgctgttac 


gaacattggt 


gtaaaaatat 


ctgctcaagt 


43140 


ccctgttttc 


aattcttttg 


gatacatacc 


taagagcgaa 


attgctggct 


tatatggagt 


43200 


tctgtgttta 


atgttttgag 


aaaatgccaa 


attgtttttc 


accttcgttg 


caccatctta 


43260 


catttccact 


agcagtgtac 


aaggattcca 


atttctccac 


atctccatca 


ataccaacca 


43320 


acaggggaaa 


agggggagta 


attattttcc 


atttattttt 


aattttcaat 


ttatgattga 


43380 


cacataattg 


tacatattta 


tggggtacag 


tgtgatgctt 


caatgcatgt 


tacattttct 


43440 


aatgatcaaa 


tcatggtaat 


taccacattc 


atcactttaa 


acatcatttc 


tttgtagtga 


43500 


tgacattcca 


aattttctct 


tctagtcacc 


tggaaatata 


cactacgttg 


taatttgcta 


43560 


tagtcatgct 


actaagtata 


gaacttattc 


ttactatcta 


tagtaagtaa 


gaacttactc 


43620 


ttcctgtcta 


gaattttata 


cccattgacc 


aatccttccc 


ggtcctccca 


tcctccctac 


43680 


cttccctagc 


ctctggtaac 


cactattcta 


gtcttcactt 


ccatgaagtc 


aactttttta 


43740 


gattccaaat 


gtgtgtgaga 


tcatgcagtg 


tttgtctttc 


tcagcctggc 


ttatttcact 


43800 
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taatataatg tcctccaggt taatgcatgt tgccacaaat gacagaatta cattctgttt 43 860 

tatggctgaa tagtattcta ttgtttgtgt gtgtgtgtac attatatata tgtaatatat 43920 

aatgtatatt ttatatatat atatattaca ttttctttat tcatttatgt gtagataggc 43980 

atttaggttg attccttatc ttggctcttg tgaatagcac tgcagtaaat atgggagtac 44040 

agatgtttct ccaactgtct tcatttcctt tgaatatata ccctgtaatg ggatttctgg 44100 

atcatatttg tttttatcat ggccatcttt tttctttctt tctttccttc tttccttctt 44160 

tctttctttc tttttctttc tttctttaat ttttatacaa agactatgtc acttaggcta 44220 

cagtggcaca agatcatatg tcacttaggc tacagtggca caagatcata gctcactgca 44280 

gcctcaaact cctgggctta aatgattctc ctgtttcagt ctcctgagta cctgggatga 44340 

aaggcatgtg ccaccacacc cacataaatt tttattttta tttttatttt tgtagagaca 44400 

aggtcttgct atgctgtcag ggctggtctt gaactcctgc ctttaagcca tccttccacc 44460 

ttggccttcc aaactgctgg gattaaaggc atgagtcacc acacccagcc ttatataatg 44520 

gccatcttaa tgggtgtgaa gtggtatctc atacttttga tttgctttta cctaatgatt 44580 

tgatgttgag catccatgta tacatcttct ttggagaaat aactgttata gtcttgcctg 44640 

tttcttaatt gggctgtttg ttttttgttg ttgaattgta gagttcttta tatattctgg 44700 

atcttagtcc ctcagtagac atatgatttg caaatatttt ctcctattct gtaggttgtc 44760 

tttgcacttt cttgatgact tttgatgcac aaaagttctt tatgaagtat aattaatcta 44820 

ttattttgtt gttttctgtg cctttcttgt cacatccatg aaattattgc caaattgtca 44880 

tgaagatatc ttccaacttt tttccatgaa atgtatggtt ttagctctta ggcttagtgt 44940 

cggattgatt ttgagttaat gtttgtatat ggtataaggt aagggtccaa ctctattctt 45000 

tcacatgtgg atgcccagtt attccagcat catatgttga aaaaattcct ttccccattg 45060 

aatattcttg gtgtctatgt tgaaaattga . tttactatat atgtgatata agtgagcatt 45120 

tatttctgaa tgattccatt ggtctatata tcttccctta tgccaggtac catatttttt 45180 

gggttttttt ttttgtttgt tttttgtttt ttgttttgat ggagtcttgc tctgtcaccc 45240 

aggctggagt gcagtggtga gatctcagct cactgcaaac tctgcctccc gggttcaagc 45300 

aattcttctg cctcggcctc ctaagtagct gggactatag gcgcatgcca ccatgccagg 45360 

ctaatttttg tatttttagt agagacgagg tttcaccaca ttggccaggc tcatctcgag 45420 

ctcctgacct caggtgatcc acccgcctcg gcctcccaaa gtgctaggat tacaggcatg 45480 

agccactgtg cccggccctg tattgtttta attagtgtag ctttacagta cgtttcaaag 45540 

ttggaaagta taatcactca aactgtatta ttctttttca ttattgtttt ggctattcag 45600 
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ggcctcttaa aattcaatat gaacttgagg 
gttggaattt tgatagaaat tgtattgaat 
5 ttaacaattt taaatcttct tatacgtgaa 

ccaattacat tcagcaatga tttatatttt 
agatttattc taaagtatgt aattatttta 

10 

atgttatttt catattgttt attgctgtta 
gctcttgtat cttgaacctt gttgcaatcc 
15 ttgggaggat tttctatata ggaagatcat 

cctcttttct aatttggatg aattttattt 
cttttagtac aatatttaat agcagcagtg 

20 

ttagggggaa agcttttaat ctttcaccac 
aaataaactt tacatattga ggaattttcc 
25 catgaaaggc atttcatcaa atgccttttc 

ttccaacatt atattaatgt aatgtattac 
gtattttgct tatcaatttc acttcatcat 

30 

caatttttgt atctacactt ataaaggata 
aattatctgt ttttggtatc aaggtaatgg 
35 ccttctcttc aagttttcag aagagtttga 

ttggtagaat tcactggtga agccatctga 
gattacgaat ctttttactt gttatggatc 

40 

ttttaggtaa tttgtgtgtt gcaaggaatt 
tggcatataa gtttttatag tattctttta 
45 tcagtaataa tgtcttactt ttatgtcttg 

cagtataggt aaaggtttat caattttgtt 
attgattttc tctattgttt ttcaattctc 

50 

ttatgctaga gtaattcttg aaatcagata 
aaattgtttt ggctgttcta ggtcatatga 
55 tgattgctac aaaaatcttg ctgggatttt 

ttataaacaa tttacttctt agtaatatgt 
ctgtattttc ttacatctca tttcatttct 

60 

aagccttatg catactttgt taaatgtatc 



PCT/US01/48437 



atccattttt 


ccatttataa 


gaaaaagact 


45660 


ctgtatattg 


ctttgagcag 


cactgacatc 


45720 


cacaagatgt 


cttttcattt 


atttaggtct 


45780 


ttagcataga 


ggtctttcat 


cccattggtt 


45840 


ggtggcatta 


taaatgaaat 


tgctttttaa 


45900 


tacagaaaca 


acaatgattt 


tttgtgggtt 


45960 


tgtattaggt 


ctagtagctt 


tcctgtggat 


46020 


gtcatctgga 


aatacagatg 


gccttaatct 


46080 


ctttttattt 


tctaatagct 


ctagctagaa 


46140 


aaagaaagca 


ttcttatata 


gttcctgatc 


46200 


tgagtatgat 


attagctgtg 


ggtgttttat 


46260 


atctcttcct 


agttttatga 


gaggttttct 


46320 


tactccaagt 


gagatgactg 


tgtgggggtt 


46380 


atttattgat 


tttcttattg 


taccatcctt 


46440 


atgaataatc 


cttttctaat 


attttgttgg 


46500 


ttaatctgta 


attttctttt 


cttgcagcat 


46560 


tggcctcata 


gaatgagtta 


ggcaattttc 


46620 


gaataattgt 


tttttttttt 


tctttaaatt 


46680 


tccagaagtt 


tcctttgtta 


ggaactttta 


46740 


tgtaaagatt 


ttctatttct 


tcttgaatca 


46800 


tgcccatttc 


ttatacagta 


tctgatttct 


46860 


caatacttta 


tacttttact 


ctctataagg 


46920 


ttttaattgt 


tttattgtct 


ttttgctttt 


46980 


gatcttttaa 


aagaatcaac 


atttgggttc 


47040 


tatggagatt 


tctcttctgc 


cccactaaaa 


47100 


gtataaatct 


tccaactttg 


ttatttttca 


47160 


atttccacat 


aagttttaga 


attcgtttat 


47220 


qqttqqaatt 

w w ^ w%> w *— * 


qtactqaqcc 


tatagataat 


47280 


agtcttctga 


tccatgtcca 


tgaaacatgt 


47340 


ctcagcacta 


ttttattatt 


tacaatgtat 


47400 


tgtaattatt 


tcattttttc 


ttgatattgt 


47460 
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acatggtatt tttaatttta ttttttaaat 
tttttatata ctggtctact atgacctttc 

5 

ttttaataga gtttctagaa ttttcttagt 
tttttactcc caaactttat gcctcatttt 
10 tacaatgttt aaaataagtg atgaaaatag 

aaaatgttca atattttacc cttaaatatg 
tttaccctgt tgcgcagatt ttttaaaatc 

15 

atctcatgct ttcatcttta tcgtattatc 
accctgtttt aagcccttta gaaacaatac 
20 ttgaatttag tgaatacaga tctttaatcc 

caggggattc aaaaataata aaatatgtac 
aactctacat ttacggagaa atttctctgt 

25 

atttggtctt tagtcttagc caccctgaag 
agcatagatc agagaggaaa gcacaggacc 
30 ccaagtaaac agctctacga tactggtctg 

tcaattctaa aagaaggata ataacttcat 
ttggaacatt gtctagcatc tggtaatcac 

35 

aaatcatagc acttaagagc tttaataaac 
ctttttaagt ggaagaataa attaaaattc 
40 atatagaaat ctcttccaaa gtaccattta 

aaaatatagg tgtgaaaaac taattagaag 
atgtgtagat tctcaatcta ggcctgctat 

45 

ttaatcactg agttctgatc cacttcatca 
attatctcag aaaggtgtaa ttcatggagt 
50 taacaacctc ctaaaaatta taataataaa 

tatcagatgt ggtcacttag ctgtttaaaa 
gaaagtccaa atttcctagc ctagcatacc 

55 

taccatctgc catcacacca tgctatatca 
ctgctgcttc tacctggatt ctgactcctg 
60 taattcctac tcagctgtta agatatagat 
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gttcattccc 


acatatagaa 


atgcaattta 


47520 


taaactaact 


tattagttcc 


catagctttt 


47580 


agataaccat 


tttaagtatt 


aataaagaca 


47640 


tcttgcccta 


ttacactgtg 


tagtctccaa 


47700 


caatctttcc 


ttgttcttga 


tcttagggtg 


47760 


atgttttttg 


taggtttttt 


acagagtaaa 


47820 


atgaatgggt 


attggctttt 


gtcatttttc 


47880 


tgtatattac 


acaactagag 


gagagtacaa 


47940 


taaaataagt 


tacactaaat 


aactcttcaa 


48000 


actaatggca 


cagtaatgtg 


caaagtacaa 


48060 


cttgatctca 


agaaaaggaa 


agggaaagtg 


48120 


gccagtttct 


gtgcttgaaa 


gtattatttc 


48180 


taggaagtag 


gtgacgttat 


taatacaaga 


48240 


tcagattcaa 


ctacaatttt 


taaatccttc 


48300 


gttatttaac 


atctctgaac 


ctcaattccc 


48360 


agggatgtca 


taccaatttt 


tgtgatgact 


48420 


tgagttctca 


ttaatggatg 


aggaaattta 


48480 


agagaaacta 


taattatttc 


ttccaaacct 


48540 


acatgaaaca 


aagtttgagg 


attcttagaa 


48600 


tactcacata 


aatagtcctt 


caatagggat 


48660 


aatgggaaat 


gcaatgtatt 


atttcagaga 


48720 


tagcaaacga 


ggtaagcata 


attaattcat 


48780 


aatagaacaa 


gtagtctatt 


ctgactatga 


48840 


cagaataaaa 


agcagcaggt 


tgtgctgcag 


48900 


atgagaacta 


atgggaagca 


ttgcaggttg 


48960 


aactccaatt 


gcttcctctt 


atttgtaaag 


49020 


atctaataat 


atcagtaata 


tctaactgct 


49080 


caaccttgtg 


tttttgtacc 


cgctattcca 


49140 


tcttatttca 


tcttaccgtc 


tatgcctgag 


49200 


taaatattgc 


ctcttctgta 


aagcatttcc 


49260 
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aggttgtctc 
gtaattgcat 
tgtgagcacc 
atctgggtga 
atataaaatg 
taatcaaatt 
ccaataagtt 
tatttcctat 
tcagacatga 
ttgccttcgg 
caatttgtgc 
atctttggag 
ccaggtaata 
cccaaatttt 
gaaaatatgt 
tctgctattg 
ctccaagttt 
ccatcccaag 
tcatcccatg 
ctatcccaag 
caatccctag 
ccttcccaag 
acatcccata 
caaccagaaa 
agatcagaag 
aaatcctcaa 
cattctttag 
caaagcaaag 
ctatccccaa 
ctcccagatc 
tctaaagatg 



agaattaggt 
ttgctgaatt 
ttaagggaaa 
tcaataaact 
tctcagaagc 
cctctgttca 
aatggtaaag 
ctactgtctt 
ttctgtcata 
atgttactca 
ttcaagaaga 
gctatgcttt 
aaggtagaga 
cagaggaaga 
ccattcagct 
tacaaccttc 
ttccatccca 
ctctaccagt 
tcaaacagtc 
acacatcatc 
atatgctatc 
atatgctatc 
ttgtgcagtt 
acactgaacc 
ttatggaaga 
gacggcattc 
acgtgcaagc 
gctggcaatc 
agaggcactc 
agcaagctga 
gacaagttaa 



catcttctat 
gtattactat 
atcccacatc 
attatagatt 
tttataatat 
atatttgaaa 
gcaggactgg 
aagtcagaaa 
ttaattaccc 
aaatagtgaa 
gttttccagt 
cttcaagtta 
atttgtgcca 
aattgaacct 
agactctaca 
tcaaatgcaa 
ttctgcacta 
agaaggcctg 
ttctaatctg 
tcaagatatg 
tcaagacaca 
ccaagctcta 
ccctgaaata 
tcaaaaccag 
gaccaaagaa 
cttaaaccag 
caaaggccag 
tccaaagcag 
cgtagataag 
agatcagcaa 
agaccagcag 



gttcccataa 
tatctattta 
taattgtctt 
aagcatgaat 
taagtatggg 
gactaaaagg 
cattcgtttt 
tgaaaagtaa 
tcatccttgc 
caacctgccc 
gatgattcaa 
acactctcta 
gatgaacaaa 
ttgcctccca 
tttaaacaaa 
accaagcttc 
aaactcgaag 
tcagaacaaa 
acagctaatg 
ctgtttcatg 
ccatcccacg 
tcagcgcatg 
caacacctac 
caaattttac 
tggaaatctg 
caaaccaaag 
aaatcctcaa 
aaatccttag 
caagctcaac 
gccaaagggg 
actgataagg 



agcactaagt 
tatgttgatg 
tgattcccag 
atatgaatat 
aggaattttt 
tgatgacttc 
ctgccttctc 
cttggaaaac 
ttttataggt 
cagaagaaaa 
caacaaatgc 
ggagtccttt 
agcaaagtat 
cactagagaa 
tgaaagatga 
tgcaggacca 
atatatcacc 
ccatgccatc 
acctgccccc 
acatgacatc 
ccatgccacc 
ccatattacc 
ttcagcagcc 
aaatgtcata 
aggaggaact 
ccttgcaata 
agaggcattc 
accagcaaat 
ttaatcaaac 
aacaataccc 
agcaaaactc 
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acctatttat 49320 

tacagatagt 49380 

catttagttc 49440 

tgactagatg 49500 

gaggcaatct 49560 

taaaataata 49620 

atttagatgt 49680 

aaaaaaaaaa 49740 

tctgtttttc 49800 

tgatcaatta 49860 

acaatctgtt 49920 

agtctcccaa 49980 

ccttccatct 50040 

aaagccctca 50100 

agatctacaa 50160 

agctgcgtca 50220 

tgaagacttg 50280 

taagtctaca 50340 

tcaaggcata 50400 

ccaagatatg 50460 

tcaagacata 50520 

tgaagcctca 50580 

cccagatctt 50640 

tcaagatatt 50700 

ccatagaaga 50760 

cttaaggaga 50820 

cttagatcag 50880 

caaagactgg 50940 

taaagagcaa 51000 

agaaggacaa 51060 

aaagaagcaa 51120 
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acccaggatc 


agcaaactga 


agaccagccg 


gcccaagaga 


agaaatcccc 


gaaaggacaa 


51180 


ttccaaaatg 


ttcaagccga 


aggacagcaa 


gctcaggtgg 


agaaagtgcc 


aaaactgtta 


51240 


tgccaagatt 


cagaatccca 


aatacagcaa 


taccaattct 


ggcaattcca 


caaaggcaat 


51300 


ctccaggctg 


gacaacccag 


gactgtcaat 


cttttggcca 


agaatcccct 


gactggataa 


. 51360 


ctcagggctg 


gagaaacaaa 


gattataaag 


cacgagaatg 


gcaatttgaa 


atgaagcact 


51420 


ggcaaacaca 


ggatctatta 


gagaaagaag 


ccctaaagca 


gaaagctcta 


taccaagaag 


51480 


tccaaaccca 


gcacgcaaca 


gcccaacata 


acctagaatg 


tcaagacact 


caagataaag 


51540 


accaacaaga 


ccttcaatcc 


agagttacac 


aaaaaggaga 


tatgtacact 


agagacatca 


51600 


aaccagggga 


catgaaatgt 


atagggcaaa 


cctcagggga 


cctgcaatca 


gaagacgtga 


51660 


aggcagattt 


tcattcttct 


tctggccaaa 


gctcagtaca 


agacacatgt 


ttagcctatt 


51720 


tgtccaatct 


agattcagaa 


caagatgtgc 


aaccagacac 


ttcagcttcc 


tcaaattcat 


51780 


ataaagaaga 


tgtgaattta 


acttctactt 


catgtgatcc 


aaaagatcaa 


cagcaatctg 


51840 


aagactctga 


ctaacatgca 


gaatctaccc 


aataccacac 


tgcccccatt 


aatggaatta 


51900 


aattgggaaa 


aacaatattg 


cctcctccaa 


tctgtgttct 


caactgtggt 


tgccacctca 


51960 


ttaacttaca 


aaaaaatgaa 


gggcatgctg 


agcactcaaa 


caatttgttc 


ttacttaaaa 


52020 


taaaatgaca 


acaaaccaaa 


tgttaacact 


gtatcatcac 


tttatgtatg 


tgaagaaata 


52080 


attcacatgt 


atattccttg 


tcatcaaagt 


tagtgtttca 


gtttataatc 


acgatttcat 


52140 


tttataactt 


aatgtttaat 


ggccctactt 


tggtatgatg 


ctagtatttt 


tcatctcccc 


52200 


cttttggcat 


tataggagtt 


cagaaaatca 


accacaggga 


agtctatgaa 


ggctctctaa 


52260 


gggttcaaga 


accaaaagag 


tcagcaagga 


acaaaaattt 


ctcccatttc 


cctgtcttgc 


52320 


tgtttccaat 


cctcttattt 


actctccctc 


ctcatattta 


tgttgaggat 


atatggactg 


52380 


agtttgctca 


attattaatg 


aggtagaatg 


tggcaatgaa 


tacagaatgt 


gatgatggag 


52440 


tgggagggat 


taagagatat 


ctagagacag 


gaaagtagaa 


attagggaag 


aacagagatg 


52500 


cagatgtcat 


aatagaaaat 


atgacatact 


gtattgtatt 


cagtgaagct 


tcagctccct 


52560 


tctttatgcc 


tgcatttact 


gattataatt 


tctcaaccct 


tatgatacat 


aatttcatcc 


52620 


tctgtcatca 


aaggaaattg 


tgttatcaaa 


agctatcagt 


tacctcaatc 


ttccaagcct 


52680 


aatcgtgttt 


tttcaatcct 


ctttcccttt 


atctctgata 


catttggaaa 


ggttaacaat 


52740 


cccttttctt 


tcaattttta 


tcacccccat 


taacttctgt 


gttttgcact 


tcctgtttca 


52800 


aaaattctat 


aataggtatt 


tctagggatc 


tctccccaac 


tttattcatt 


ttttgtaatg 


52860 


acaattcact 


attacagttc 


tctcaccttc 


ttgtgaataa 


tccagactgt 


gtcatatttg 


52920 
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tgtcttctac 
ttaagggcat 
agtagtataa 
ctttgtagta 
ttacctgagt 
ataatactaa 
tgtagagttt 
tatatatata 
tatagagaga 
ttgtatcaaa 
tctacccaac 
tagtgccaag 
actgagaaac 
ctatgctaca 
ttagacttac 
tcccactgag 
gtaacctttc 
ttgacctccc 
gacaaaatca 
aagacaaaat 
taattttttt 
aggcttcttt 
agttcccaag 
ctagaatcaa 
agatggtaac 
ctacaatatt 
cctgagtctc 
gaatattcaa 
agctgtttgg 
caaatcttgc 
ctgcataaca 



agaatttcag 
aaatgggagc 
taagatacag 
aaatattagc 
tagcctttag 
taatttctac 
aactttcagt 
tatatatgta 
gagagagaga 
ttctttgaga 
atacacaaat 
tcactggaga 
ttataggcca 
tttggaacat 
aatatgagaa 
ggagatgcta 
tagtctacag 
ctttctgaga 
aatgtgttta 
gcaaaaggct 
attaaaaact 
gtagtaaaat 
aactgtggaa 
caacctacag 
tattgccaat 
tttttccctc 
tagagaatgg 
ctggattgtg 
ctagaaggta 
tcaaaatggc 
aattaccaca 



cacaaaggct 
ttggagtctc 
agctccttat 
ttcctgagct 
ttcctgaatg 
cttttccttt 
ctacaaactt 
tgtatatata 
gtgtatatat 
atccagccca 
acatttctgc 
ggcaatttcc 
agttcaaata 
ggaaattttt 
tggaagaatg 
aggggcaagt 
aagactctct 
cacctaatac 
agacccataa 
gtgggaaact 
gtcttattac 
tgggaaaagt 
aaaaaaataa 
ctaaattata 
ggaatgtaaa 
atgcactggc 
agcctggtcc 
atctaaaaga 
gtctaagctg 
aggactccta 
aaggtagctg 



tgattcaaga 
tacttagagg 
gactggctac 
tcacatgatt 
accaccacct 
agctagatta 

gggtgtttag 

tatatatata 
atatatatat 
cacagttact 
agaaatgtac 
atcatcagcg 
taccgtatgt 
ccaaaattgc 
taaaaaacca 
cagaagctcc 
tttctctctg 
tgtcctatca 
ggaaaaataa 
agaaaattta 
aaatataata 
tgatttgaga 
gctggctgct 
cttcccagcc 
gtttgactcc 
tagatgcaga 
ataaatcgtg 
aaaacccata 
agtaatagag 
ctcgtatctc 
cttaaaacaa 



atacattcaa 
tgaacagaga 
atgttccttg 
gcttgacctt 
ggtattttaa 
gcacgtttct 
gaagtatata 
tatgtatgta 
atacacacac 
cttgccagtt 
aaagatcctg 
actagtccac 
gtcctgtgaa 
aagaatacaa 
cttcttctga 
caaggccccc 
tttcttgaag 
atgagtttgg 
tgtttgaagt 
tgatttgctt 
tgtgtttcct 
tactgctacc 
tacaattgca 
accttgcatc 
taaaatcacc 
gaatctagga 
gtgtggatta 
ttagggcaag 
tctttaaggc 
cttgtgttag 
caagcttatt 



taaaacattt 
agttatatgt 
ctgcctgtgc 
gatagagtcc 
cttactctga 
acagtgctta 
tatatatata 
tatatatata 
acacacaatt 
catccctata 
gaatcttaaa 
accagttacc 
cttgttgctt 
cgcctatagg 
ggcctttcca 
agaacccata 
gcattttccc 
gtctgaagtt 
atctgtttaa 
aaaggcaacc 
agaatatgtc 
ttacacactc 
ttttcattta 
taggtaggtg 
aacattatat 
tccacaagac 
cttccagttt 
gaatgcatgg 
acaaaaaagt 
ttacctattg 
atctctcagt 



52980 
53040 
53100 
53160 
53220 
53280 
53340 
53400 
53460 
53520 
53580 
53640 
53700 
53760 
53820 
53880 
53940 
54000 
54060 
54120 
54180 
54240 
54300 
54360 
54420 
54480 
54540 
54600 
54660 
54720 
54780 
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tcagaagccc aggaaagctt ggttaaatta 
gaggactggc cagggcagtg gttctggggt 

5 

ggaagaatcc agtcctcacg gctgcaggac 
ctgggaactg ctcctgcatc ccttctcatg 
10 tgtgtcaagt cttctcaaac ccaaatctga 

ccttttatta cactagactt actggaataa 
ttaataacct taaatttctg tgcaaagttc 

15 

aaagattatg ggctcccata aatagtcaca 
tgttttccta atttcttaga aacatgtttg 
20 gttttaaaga gattactgtg gctgcagttt 

gcagggagtc aattaggaaa ctcttaaaat 
gatgtataga agagtagaga ctaaaatgat 

25 

agaaactaga ggtgtagaat tgaagaaaat 
gattagctgg ctgagacatc attaccatca 
30 caagagtaag gggcatagtc agttctttct 

gaaacagaac caagaggatt agacagatag 
tagatatata gaagatagat agatagatag 

35 

gatagataga tagatagata gatagataga 
ataaggctga gaagtcccaa gatctgaaag 
40 tccagcctga gtacgaaggc ctcagaacca 

tgaaggtagg aaaagaccag tgtcccagct 
ttgctcaggc tttggtttta tttaagccct 

45 

ttaccaccgt tcatggaata agatttaaag 
aaagaaaact agaggatctg gagactccta 
50 tgctctagct ctgctcccag agttgttctc 

gaaggcttta acagcagaga aaaataagag 
gctgcagtaa cagaagaatg agagattaaa 

55 

gctttattca gtttaccaat gtaaaaacct 
gaataaagtt taaccaaaag tcttggcacc 
60 accatcaaaa ggccatcctt tgtgaacctg 



tttgtacaga 


atatcacatg 


gctaaaatca 


54840 


tgagctcctc 


ttcaaagttc 


atccattgtt 


54900 


tgaggttccc 


atttccttgc 


tggctgtcaa 


54960 


tgatcccttc 


catcttcaac 


caggtaacag 


55020 


ctccctctcc 


tgcatctggt 


tttaaggagt 


55080 


tccaagataa 


cctattttag 


gataaattga 


55140 


cttttctcat 


tgaagataac 


cacaattagg 


55200 


tggaaaaata 


cacacaattt 


taaaaattaa 


55260 


acttttaaag 


caggaactaa 


ctgaattggt 


55320 


ggaggatgaa 


ctggagaaag 


aaagatggca 


55380 


agtgaagaaa 


gaggtgctaa 


gtgatgtagg 


55440 


aattcaagag 


taaaattgtc 


aggatgggac 


55500 


gaagaataaa 


ccatctgctc 


tagcttaaaa 


55560 


acaaaacaga 


cgaagaggca 


caggttgcag 


55620 


agtgtagtgt 


attaggcagg 


gttctccaga 


55680 


acagagagct 


atagatagat 


atatagatga 


55740 


atgatagata 


gatagataga 


tagattgata 


55800 


tagatagata 


aggaattggc 


tcacatgatt 


55860 


ctggagaccc 


aggagagctg 


atggtatagt 


55920 


agagtttaaa 


ttccagttca 


agtcttagtc 


55980 


caaagacagg 


cagagaaagc 


aaattctctc 


56040 


cattggagac 


attgagagta 


acactgggtt 


56100 


acactattgt 


tatttataat 


taaaaaaaaa 


56160 


agcaggtgac 


atcttgccac 


aacagttaac 


56220 


tccagagtat 


ggatatggat 


ctttttaaag 


56280 


aaacttagaa 


acaaagaaaa 


ttgactttca 


56340 


tgaggcccac 


ccatattggg 


gagggcatct 


56400 


catccagact 


catcctcata 


gacacactaa 


56460 


tattgttcaa 


tcaagatgat 


gcataattta 


56520 


acactcatat 


gcatctcctt 


aagccatact 


56580 
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taatctccaa 


ataaagacta 


aaaggtgata 


attcttcata 


atatgttaca 


actatcatgt 


56640 




atacaactgg 


aaatgtacca 


acctcttctc 


agaagaaggg 


* 

gtaaagtctg 


tgaatgatgt 


56700 


5 


ttactcttct 


ccttactgtc 


tagcaacttc 


tatgatttaa 


attgtatgat 


gtaaagttaa 


56760 




caatactata 


atataaacat 


gccttatgtt 


acatgattag 


agaatgagaa 


agaacagatt 


56820 


10 


gtgtgtgtgt 


gtgtgtgtgt 


gtgtagtttc 


tataactgga 


tatgtggacg 


taactcatat 


56880 


ttataactac 


cttattctac 


tacccattct 


gtagtccctt 


tgccttcagc 


aaacacctta 


56940 




gctggccatg 


gttctttacc 


tggtagaatg 


acccaaacct 


tcattcctga 


aaagcctgga 


57000 


15 


caattagtgg 


tcctgcctgg 


actgagttgt 


tgtggttttc 


ttaattgacc 


ttaatcacag 


57060 




atcatggtaa 


tactaagaga 


tgeatgaage 


gatctcctat 


attccaagca 


tagtgttccc 


57120 


20 


ttatgtccat 


tgtggaatag 


tggtctaatt 


tccccttagt 


catcagaaac 


aatcacccca 


57180 




tccagcacag 


taaccccttt 


cagttatctg 


aagaggatgg 


aagggctttg 


ctccaaaatt 


57240 




cagcaataaa 


cctataggag 


cctaccttcc 


cagaagctct 


atatagcatc 


cctatctgcc 


57300 


25 


attgacactt 


caagcaccat 


tggatctgtt 


gatggtatgg 


tccaagcaga 


agtcagctta 


57360 




cacggaagcc 


gaggcctgtt 


gtgaagcttt 


tcttctggtc 


ctactcaaaa 


gcagctctgt 


57420 


30 


aagtcacttg 


gcaaattggc 


cagagcaaca 


cacccaagtg 


aggaatatgt 


tgcctccaaa 


57480 




atccaaagaa 


actcattaga 


tgctgtgcct 


cctattcatt 


gtggaagcca 


gatgegacaa 


57540 




cttatcctca 


tgctggaaga 


gattctcaat 


attccccaca 


ccacttgatc 


cctagaaatt 


57600 


35 


tcactgaggt 


acaagaccct 


gaggttttat 


cagatttatt 


ttccacactc 


tggcatgtaa 


57660 




atatcttatg 


aataaatcta 


gagtagttac 


tacgtcttat 


tegctaggtc 


caatcagcat 


57720 


40 


aatgtcatca 


acctaatgga 


ccagtgtgat 


atcttgtgga 


aggaaagggc 


aattaagatc 


57780 




cctgtgaact 


aaattatgac 


atagagctat 


aaagttgata 


tacccccgag 


gtatgacagt 


57840 




gaatgtgtat 


tgctggcctt 


gtcatctgaa 


agcaaactgt 


tcctggtggt 


ctttactaat 


57900 


45 


agctgtccag 


aaaaagcatt 


tatcagatca 


ataacattat 


actgggtact 


agaggatgtg 


57960 




ttaatttgct 


caagtaatga 


aaccacattt 


ggtaaggcag 


ttgeaattag 


agtcacctcc 


58020 


50 


aattggtaat 


ccactgtcat 


cttccaagat 


ccatctgtct 


tetgeacatg 


ccaaacatac 


58080 




aagttgaata 


ggaatgtggt 


gggaatcacc 


acccctgcaa 


ctttcaagtt 


actgatggtg 


58140 




tcattaatct 


ctgtagcccc 


tccaggaatg 


tgttgggagt 


caccaaccct 


gcatctttta 


58200 


55 


acrt tact era t 


qqat t tatt a 


atttctqeaa 


taccatcaQQ 


aatqtqqtqa 


tget tttget 


58260 




ttactatttt 


cctaggcaga 


ggcagtacta 


gaacttccac 


ttggtattcc 


ccaccataaa 


58320 


60 


» 

ggtccttact 


ccacagagta 


agaaaataat 


atgagatcct 


accacctatt 


gagtatattt 


58380 




gttttaatta 


tgcattctgg 


aattggggga 


ataacegcag 


gatgggtttg 


ggagcccact 


58440 
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gcacccattg 


tgagattaac 


cagagataaa 


actccatgga 


tcatcggacc 


tccacaacca 


5 


cttactctga 


gtggtaagcc 


acagtgatgc 


tttgagtctc 


ctggaattag 


tatcaattca 


gaatcagtgt 


ctagtagtcc 


ctgagaggtc 


tggttatttc 


cttttcccca 


atgctcagta 


* 


ccctggaaaa 


aggctgcagg 


tccctttgag 


gaaggctgtg 


agaaagatca 


acagtacaaa 


10 


ttcttggtaa 


tgcaccaggt 


ccttccttaa 


ggagacctgg 


tctccccttt 


cattcaagga 




gttctttctg 


ggtttgtaaa 


ttttattgca 


ggtctgtaaa 


ctggctcagg 


tctgggaatg 


15 


gattgagggg 


ctatgagtct 


atatttttat 


gattcaggtt 


aggctttcat 


tcaatgacct 


agaacttttc 


tgcttatata 


gatcaagggg 


taatttaaaa 


ggatttctgt 


ctattttacc 




l- v_, ciy y cicn_ a. 


w a ^— ■ a. cl i_ aa 


citLeicscicic aca 


cccacaccat 


aaat c tcrat t 


caaatat t aa 


20 


tcttatccat 


acccaccctc 


acagacacag 


ccaaaataat 


gttttaatta 


agtatttgag 




cagtctatga 


cctggtcaag 


ttgacacata 


aagttaacca 


tcgtattttg 


tttatactca 


25 


cagtcttggg 


gccacaatct 


ggaatgagtc 


ttagtcgaaa 


tcacctgcac 


agatcagagg 




agctgctggg 


atgcaaatgt 


atactaagca 


agtactcatt 


gagatatagc 


attagcagga 




cx aaaa a i— • y 


attattttaa 


aarataaatc 


caaaaatctt 


ctctacraaaa 


t aataatcta 


30 


tatttatgca 


acctctcact 


tgtggttatc 


atgcctagaa 


aggcatgcag 


gcccatcaca 




tttctgaaag 


gctgttactg 


ttagtgaacc 


acacagacac 


aaagagtcct 


gaggaaagtg 


35 


cagatctcct 


ttgcagccat 


aatatcctcc 


atcagttcac 


aaggtactct 


ttggacattt 




ggaaattctc 


atctctcact 


ctgcttcccc 


cactaacaca 


tacacacata 


cacctttctt 




' — i — - ^> — ^ — - i— y L.y y i— 


\^ *w V — y l_ y u. V_- V— ■ 


tttttttttt 


tttttacttc 


tctaacctac 


c c t acrcra 1 1 c 


40 


aaatgtggag 


tttctggtca 


gagagttaaa 


gttttatgca 


tcactggaga 


aagctttagt 




tgtcaaggca 


aagctctgaa 


gcatcaaatt 


gagtacatgg 


aatctaaagg 


taaagcaact 


45 


ggatcataat 


tttgacccag 


caccattact 


agtgggatac 


tttgggaaag 


tttcttaaaa 




tttccgtgcc 


taagttgagt 


agtactcagc 


acagggttgt 


caagaagatt 


ggataattaa 




ay act uy ua La 


y y i— L-cicicxci 


fa trfafrar 

cl <— ^ ^ y 1 — c*. u 


CICTP Cf t" 3 CI a t" CI 

yycty ^y «. uy 


rf raatttta 


ttccataata 


50 


aaacatctat 


aaagctaatt 


cttttactag 


tcggtattac 


ttcttatggc 


ataaacttga 




tctaatttat 


gccaccaaag 


tatctctctt 


tattctgaat 


tatacaaatt 


atctccctgc 


55 


tattgagtga 


actgcatccc 


actctaaatt 


catttgttga 


agttctagct 


cacagtgtga 




tatctggaga 


cgggcttttg 


aagctgttgg 


gtttagatga 


ggatatgaga 


gtgaatccct 




catgacggga 


ttagtgtcct 


tatatgaaaa 


gacaccaggc 


ctctgtctcc 


tgctcttttt 


60 


ctgtgtgtgt 


gtccttgtgt 


gtgtgctcac 


atgcctgtgt 


acacacaaag 


gaaagatgat 



58500 
58560 
58620 
58680 
58740 
58800 
58860 
58920 
58980 
59040 
59100 
59160 
59220 
59280 
59340 
59400 
59460 
59520 
59580 
59640 
59700 
59760 
59820 
59880 
59940 
60000 
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gtgaggacag aaataataca gtcatctgca 
agttagctga cactttgatc ttggaattcg 
5 tgctgtttaa atgaagcagt catggtaatt 

accttttctt cttgcaaatt gggaaaggga 
atgaaacaaa gaaacttttt gttctttaaa 

10 

tatctaatac ctatgtgtaa gtccgaaatt 
agtaggaaat ccctgttgct gcctttgtag 
15 ttaaagtatg ctatatatgc aagactgtgc 

aatgaaaacc agcaggacag tgcagtgaga 
atgcagagaa aactaagaat tcaatagatg 

20 

cgatagaagc cccagtggtt ccccttccct 
tggagacatt aagagtgaca ctgggtttta 
25 ctattattat ttataaactt taaaaacaaa 

cgggtgacat ctggccacaa cagttgactg 
caaagtatgg atatggatct ttttaaagga 

30 

acttaaaaac aaagaaaatt gactttcagc 
cagtgttttt cttggaactt tccagcaagg 
35 taatcccatt gactatggtt cagagaactt 

aatacttagc aataagaagc ttgaaattaa 
acatattata tattattcca taaaccacat 

40 

aggttttggc agattggaaa ataagaagaa 
tgaacagata catgtcatca atgtataaag 
45 tggagttcca cagccaccct gaaatggaga 

ggcacggtgg ctcatgcctg taatcccagc 
aggtcaggag atcgagacca atctggctaa 

50 

caaaaaaatt agccaggcat ggtggcgggc 
ggcaggggaa tggcgcaaac ccaggaggca 
55 gcactccagc ctgggcaaca gagcaagact 

tgcactgatt cagtgtgaca tttccagtat 
ctataaagaa tcaacaaaaa atgaactttg 

60 

tatctgctca ccatgctaaa aatggcaata 
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agccaagaag 


agagctctca 


taaggaacca 


60300 


cagcctccaa 


aagtatgaga 


aataaatttg 


60360 


taccatagca 


ccctaagcag 


aataatatac 


60420 


gtgaaggcat 


aaaaatccca 


tctgaccaca 


60480 


cataaaattc 


ctaggtagtc 


acaaagggaa 


60540 


agtaattaga 


gtgtttatgt 


cggaagatac 


60600 


gttctgtaga 


atggtcttct 


aatcttattt 


60660 


tactactacc 


ttagaggagt 


gaagaatcag 


60720 


aaaacagtca 


tcacttacag 


aagcccagac 


60780 


cagaaaatca 


gagcttgaga 


ccactccaca 


60840 


cgtattcgct 


tgctccttcc 


aagaaatgtc 


60900 


ccattattca 


tggaataaga 


tttaaagaca 


60960 


agaaaactag 


aggatttagg 


gactcctaag 


61020 


ctctaattct 


gctcccagag 


ttgttctctc 


61080 


gggctttaac 


aacagagaaa 


aataagagaa 


61140 


tgcagtaaga 


gaagaacgat 


agaaagaata 


61200 


gaatcagacg 


agaggctgtg 


aggaaaggcc 


61260 


ggattcctca 


gatcatcata 


acttctctgg 


61320 


ggctgaattt 


cagaacatta 


atgaatacaa 


61380 


taagactaga 


gtaatttgca 


ttaaaagagt 


61440 


atatgaggat 


atggaaaatt 


tttgaagtag 


61500 


gggcctaggg 


catttattag 


caggaggcca 


61560 


aattgtaaaa 


taacgtctgt 


actggggccg 


61620 


actttgggag 


gctgaggcag 


gcagatcaca 


61680 


cacggtgaaa 


ccctgtctct 


actaaaaata 


61740 


acctgtaatc 


ccagctactc 


aggaggctga 


61800 


gaactttcag 


tgagccgaga 


tcgcaccact 


61860 


ccatc t caaa 


aaataaaat a 


aaataatcrtc 


61920 


tatagctaag 


ctcatagaaa 


cacatttaac 


61980 


gaaagagtgc 


aaacataaag 


caagacaaag 


62040 


ataagagaaa 


taatacttat 


tactcttatt 


62100 
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tttattacta 


ctacaattta 


cttagcatct 


actacactcc 


aggtacttta 


aaatttactc 


62160 


5 


acaataactg 


tataacaatt 


tttattctca 


taaaggaaat 


gtaatatttt 


aataaatatg 


62220 


taaattaata 


gataacttag 


cctagacccc 


ataaatagta 


aataaaaggg 


cttggggttc 


62280 




attctaggtt 


tctcttctcc 


aaagagtaag 


tcttccactg 


catcaccaga 


ttgcatgaat 


62340 


10 


taaagaataa 


taaaaaacaa 


agaacaataa 


aaaattaaag 


ttatatttct 


gccatttgtc 


62400 




agatagctat 


tataacactg 


tgctagtcat 


catgcttaga 


gggagaaaaa 


acaaaaactg 


62460 


15 


gggaaggtat 


agagcaaagt 


taaaacctga 


gtaattaggg 


tgaaatggtg 


ttttataaaa 


62520 


gtatgttgca 


acatagtgtg 


cttaacttag 


ggaatgctaa 


cacaacatca 


gatgatgttc 


62580 




atoaaQQata 


atcaccatta 


tcrattcaacQ 


tccatagaaa 


actgaacaag 


agatggatac 


62640 


20 


gttcagagaa 


aagaaattgt 


gagggagggg 


tggcctttta 


aaatattcta 


actggcaact 


62700 




ggtaagtggc 


tagaggctgt 


cacatcatga 


ggttcaatta 


aggaagacct 


gatctcaata 


62760 


25 


ggaaacacca 


gaaaatgaca 


aggtacagtt 


aaaaacagac 


acaaattgtt 


cccttccctt 


62820 




tctttcttgt 


cctttcaaga 


tgaaagtaca 


gatcctccca 


tgaagacatt 


agggggccta 


62880 




ttttcttacc 


cctcoaatct 


QQQt tcrt c t a 


taaaatttaa 


ttggccaata 


tgacattaac 


62940 


30 


acacatgact 


cagcagaggt 


ttgaaaagta 


tttgtgaatt 


ggaagagaac 


caagagtctg 


63000 




ccaactgcca 


gagacatgac 


tgaggccatc 


ctaattcatg 


taattggaca 


ctccaccagg 


63060 


35 


tgaccacagt 


tgcatgagac 


agcctggtag 


agaagtgctg 


agcagcctta 


gatgagacag 


63120 




accacacaca 


ggacccacag 


atgtgtaaac 


aaaagccgat 


ggtggttgct 


ggaactaact 


63180 




acat t t tciaa 


ooatttQt ta 


cacaqctaaa 


qctacttaaa 


t cacratQcrca 


aaaccacaaa 


63240 


40 


gtgacaggaa 


aaatggatgt 


tatctatact 


tcaagaagtg 


gttaatttgt 


tccgtggcct 


63300 




gagagatttt 


gcgttctacc 


tgtgtctgaa 


gatatggaac 


tggaataaat 


gtgtatctag 


63360 


45 


ttgccatttg 


gtaactaaaa 


tgtccttttt 


tttgagacag 


agtcttgctc 


tgttgcccag 


63420 




gctggagtgc 


agtggtgcaa 


tttcagctca 


aggaaacctc 


cggctcctgg 


gttcaagtga 


63480 




t tc tcatcjcc 


tcaocc tccc 


aaatacfctQQ 


qactacaqoc 


acotcrccacc 


acacccqqct 


63540 


50 


aattttttat 


atttttagta 


gagatggggt 


ttcaccatgt 


tggccaggct 


ggtctctaac 


63600 




tcctgacctc 


aggtgatccg 


cccgcctcag 


cctcccaaag 


tgctgggatt 


acaggcgcga 


63660 


55 


gccaccatgc 


gcggtcccta 


aaatgtcttt 


atatgtagat 


catctagtcc 


aatttggtca 


63720 




atgtatatat 


gaggaatcag 


aaaccaggag 


agggggatgt 


gacttgccaa 


aagtcacagt 


63780 




aaactcctaa 


ctaaaccagg 


actacttgtc 


aggttctctg 


atccccagtc 


ccaggcattt 


63840 


60 


tacttatagc 


catcttggga 


tcttgagcag 


tcctgagatt 


ctatgactct 


gtgactctat 


63900 
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ggttctataa ttccaagaca aagcaaacaa 
aaaagaacat ggtctagact gaagtaccaa 
5 caccatcatg gattcaagca ccgcacacag 

cactgcttca gaatatgagt ccacagaact 
cttgcaaaaa ttatttgcta gaaaaatgaa 

10 

tgtatatttt actgaggcag gggaaaggct 
ggtaggagcc tattccaatt cttctttaag 
1 5 tttgagagag gatgtttgag aagaacgaat 

atcatttagg actcacctac ttaaaaagtg 
cagactgaga aggtggactg agaagaagag 

20 

atattgtata ccttccattt ttgtatattg 
accactgctt ggtgtcacag tgcatcacat 
25 aaaaacccga agagatggag agtcctctgg 

tcagagaatt ggccatctcc aggcagggct 
ctggtatact catttagaat tcagtgtttc 

30 

tgaagaactt atgaagacac aggtttccca 
ggatagggcc tgatagttcg catttctagc 
35 ggaccacatt ttcaggagca ctgatttaga 

aggaattaag aaagtaccca ggagcatcat 
accaaagaga gtaattgcca aatccttttg 

40 

aactatcact ttgcttttcc tcaccatgac 
ctgtttggaa ttatgacctt ttcttttgga 
45 tatccaaggt ttccctttat atttctttca 

agtatagtca atcaagttca atttgaagcc 
caatgaaata agctagatcc agttgtatga 

50 

tgttgaaatt atttccccat tagttcatca 
aagttccttt gggaatactt cttttgaggc 
55 ttgaaattag ttactattta ttataagcca 

cccaaagacc tctcagtcta gcaagaaaac 
taaatgttat cacagaggtg caggaaatgc 

60 

ttctttacag aagtgaagtg ggaagggttt 
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ttccagtgct 


ccaggcagcc 


tcagcacaag 


63960 


ctaaatcatc 


tcctttcaaa 


ttatcaccga 


64020 


tccggtgttt 


ctggtatttc 


ctccagaaat 


64080 


ttcagccacg 


accttttcaa 


ctcaaagccc 


64140 


aatcttaggg 


gtaagtaaga 


cttgccccta 


64200 


agggaaatta 


tctgttggca 


catgtttgaa 


64260 


acaatgtgaa 


tatagagtgt 


tggtgggcca 


64320 


catgtgggtc 


tggacttgac 


aaaaaatgct 


64380 


atagttgcta 


tggcagtaaa 


tgagattact 


64440 


agcagagcac 


atagactcac 


acgtttttgt 


64500 


tcctcattag 


tctttcccag 


cacttgtcaa 


64560 


gagtggctca 


gagagggaga 


gaaaaaaaaa 


64620 


gcctgagagt 


gtgacaatac 


ctcaagaggc 


64680 


cctcacagtc 


caattctgct 


tgaggccagc 


64740 


tcagacattc 


acatttatgg 


gctaaagaac 


64800 


gcctcacctg 


gagaccgttc 


agtaagtatg 


64860 


tcccaagtga 


tcctgatgtt 


gccaatccat 


64920 


tcattttacc 


tatcaccaaa 


taccagatag 


64980 


aaagggactt 


ctattattct 


tcccctaatt 


65 04 0 


accaaaagaa 


gatattctaa 


acagaaggga 


65100 


tttttcctct 


atttgcagac 


tatccagatc 


65160 


gttatcttcc 


ttttcacttt 


gttaaaacca 


65220 


ggatatccat 


tctggggctc 


tgttttggtg 


65280 


atgccaacca 


ggatgttagg 


gaattcttca 


65340 


catgctttca 


aaagtgcaaa 


aaactaacct 


65400 


cagtcactga 


tattccatca 


ttaggtcttg 


65460 


ttgcagttgt 


ttatttggct 


attcattcat 


65520 


Qatacacacra 


qaaaocfcaat 


qqqcctctcc 


65580 


atgtaaacag 


gtcacttcaa 


gtcagtgtga 


65640 


ttatgtaagc 


agagatgaag 


gagcaattta 


65700 


ctcagatata 


ccttttgagc 


tgcggagagg 


65760 



-253- 



WO 02/062946 

tatttattaa acccagtgta gtcacaatca 
ccagagaatg ggcagagcaa tggcttcagg 

5 

tcagttgagt gaggagtcag acaattaaca 
ggctgcctcc ataggaaagg catcaacaaa 
10 agcttgacca agttttaaag ggtgaatggt 

gggagttggg atacccaagt gtaaggatta 
tatgagcaca ggctgaaggt atgggagagc 

15 

gcacagggag tgaggatgga gagatgaggc 
taaacatgga agtgctgtaa tcagatttga 
20 gtgacccact tccaagaaaa atcttggaga 

ccccacctta tgttcccatc caagagaaca 
ttaatgatta taaactgact tgaaatccct 

25 

agctataaaa agaacatgga tgagtaaacc 
tgttcaaata aatagttagg gctaaaggag 
30 catttgtgga aagtcatata aagcaactca 

cttagaaggc gggcctgtaa aagaaatgca 
agctaaacat ggtcatcatt aacatattta 

35 

cttcctaatt gcagtgaaaa gaaaaaccac 
attaaaaata tattttgtag ccagtcatgg 
40 gaggccaaga cgggtggatc atgaggtcag 

aaaccccatc tctactaaaa atacaaaatt 
ccagctactc aggaggctga ggcaggagaa 

45 

ggagccgaga tcgtgcctgg gcaacggaaa 
aaagtaatga gttacatttt cactatttcc 
50 atttatcctg gccttgaaaa aaatttgtaa 

atcaagtatt aataccaata aacagcattt 
tatgctagag gagaaagttt ctctaggtca 

55 

acactgaggc ttacaatcgt taaatagctt 
ttatttcaac attccagtta cttattcatt 
60 ccaggcactt tgacaggcac tactttgact 



PCT/US01/48437 



ataaacatta 


acttaggacc 


taatttagtg 


65820 


gaaaagaaag 


cagtgcctga 


gctcaaggtc 


65880 


cagggtgtga 


tgagggagaa 


taaacacagg 


65940 


acagagggga 


caacgcccag 


agacatttcc 


66000 


taatcaggtg 


aggacacaca 


cacagaagat 


66060 


gggatgtctt 


gttgcatgcc 


aagggagaaa 


66120 


attttctatt 


atgtgaacta 


taagagttca 


66180 


tggaaaatac 


ccatgaaggg 


gctggaaatt 


66240 


aatttccact 


agacaacaat 


gctaccacgt 


66300 


ttttaagaac 


taccagcaca 


ttcactgcaa 


66360 


cactggattg 


cagtaagatg 


gaatccaatc 


66420 


ttgtcatgag 


atcaaagctg 


agtccacaat 


66480 


aaagccctgt 


acggttaaaa 


aaaatgttca 


66540 


atttccagac 


cagtgttttg 


tccactattc 


66600 


cccttctcac 


tgccctgagg 


agtgtgtgaa 


66660 


tagaggtcta 


ttctgtaaga 


actcaacatc 


66720 


ttcctcttaa 


cagttcatta 


attctggagc 


66780 


agaaactctg 


gtgagttata 


ttcttacttt 


66840 


tggctcatac 


ctgtaatccc 


agcactttgg 


66900 


gagttcaaga 


ccagcctggc 


caacatggtg 


66960 


agccaggtgt 


ggtggcacac 


acctgtagtc 


67020 


ttgcttgtac 


ctgggaggtg 


gaggttgcaa 


67080 


agactccatc 


tcaaaaaaaa 


aaaacaaaaa 


67140 


actttattaa 


aaatatataa 


ctttcaaata 


67200 


aatcattagg 


tcaccataat 


tccacaaact 


67260 


gcttcctgtt 


ttcctagaat 


tataacccat 


67320 


tctagtttag 


ctcttatttt 


atgtatgaag 


67380 


gtccaaattc 


ccagccaatt 


aatggtagaa 


67440 


cttttttgtt 


ttctgggtat 


ttacggcata 


67500 


catacgaaag 


gcaaactgta 


gacaatctaa 


67560 



-254- 



WO 02/062946 

ttaaaccata aacagaatct caaaataaag 
ataaattata agtagaatat ttttacttta 
5 atgccaacaa aagattttag caaaggactg 

attaaaaatg gcagagaagt gcagcattca 
taaacagacc atgactttca ccagaaaatc 

10 

tcccaactga ataagagtta acttaggcat 
gttcacaaga tccccttatt gtgtagatga 
15 cgcacaggat cacagagctg cctcgtggca 

cattctccgc actcattgct ttgtttgtag 
ttcttattct atttcagata atattgagcc 

20 

caatagctgg aatcattctc ctcacatttg 
gttattctca ccaaaatagt cagtgtaagg 
25 tgcatttata gagtgatgag taaagggctt 

caccattcag atcatgttgt ggacacatgt 
tccataatat ttgatagaca ttacaccagg 

30 

aactcccacc tttaggaatt cacaggagca 
atacaaatat aaatctaaaa aaaaaaggta 
35 atgtacatta gaatcacagt ggtgtgtttt 

aacttccgat aaagcatgat aagattgaaa 
accactaaga ttagtgtaaa agaatgtaaa 

40 

gttaccaaac acacacaaaa attgctgagg 
agggggaagt catattagaa gatatttctt 
45 gaatgcacca ggtccacaga aagcaggact 

gtaatggtta agtatggcta tgtggattaa 
aaaaaacaaa acaaccttaa agacattaaa 

50 

caatttttag atgaaagcct gcaacctgaa 
agctacttat gccttcagac ccatttgcca 
55 ccatgtggac tgaaagagaa agaagtcaag 

aagagagatt ttctccgtat taaactaggt 
gggcaaaaag atatacaacg taccctgccc 

60 

aaaaggaaaa gtatttgaga agtgtggcca 
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aatacagtgc 


tactgaatgt 


aaagttatag 


67620 


ctcaaagaat 


tgaagacata 


agatattact 


67680 


aaaatcctta 


gtttgtcatt 


ttagggagaa 


67740 


gttaaactta 


ggagtcttaa 


aataaaaatg 


67800 


aaaatgcaat 


tgttacaggc 


acacccttcc 


67860 


agaatatggg 


gaatggaaag 


gttactaaaa 


67920 


aggcctgaag 


cccaaagatg 


tgaagcaatt 


67980 


ttaagatgct 


gtgactaaaa 


tcctgctctc 


68040 


tacaggctgg 


acctcagtca 


cattgttatg 


68100 


gaataatgaa 


ttttcttagt 


gccctgggag 


68160 


gtttcatcct 


agatcaaaac 


tacatttgtg 


68220 


ctgttactgt 


cctgttcttg 


gtaagtatgt 


68280 


aatagaaaat 


atttattagc 


actcaatcag 


68340 


attttctttt 


actgacaaac 


atttatgaaa 


68400 


cattgtggaa 


gagaaaaata 


aacacattat 


68460 


caagagataa 


aacatacaca 


aatcagtatg 


68520 


cagcagtatg 


gctgtagttt 


atagccctat 


68580 


tacagttctt 


gatttggttt 


taacaggctg 


68640 


aggggcattt 


attttagctg 


ccttcaaaat 


68700 


aggtttgatc 


cacatggaga 


gaaaaagaga 


68760 


aaattgaaaa 


cccaccatct 


aaggcaaagc 


68820 


acagcagagc 


ctaagaaata 


cttaatattt 


68880 


aaagagcaga 


aaatagggag 


actgattttg 


68940 


ctcttctctg 


aaaacagaag 


tctttttatt 


69000 


aagctgaaag 


gtcagctgga 


actactagat 


69060 


atttacaagt 


ggaactctga 


agagcatgaa 


69120 


gtgatgctta 


tttggaatcc 


aatccctggc 


69180 


crtttcacatc 


cac t caaaot 


ataaaaactt 


69240 


ctttgatagg 


ctactacgct 


cttcaggtaa 


69300 


caaaatagca 


aagtcgctga 


acaaagttta 


69360 


caagccaaac 


aaaccccttg 


acaacctcct 


69420 
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tgcacgcccc 


aatccccact 


gacctttgct 


D 


tacctgaatt 


tggtcttggt 


atgtggtgcc 




ctatctagga 


gaaaaacatt 


cattaatctt 




attaacagta 


agcactcaaa 


agaggcctaa 


10 


atagaaacag 


aacatagcaa 


aaaaaaaatc 




gaagagattt 


taaaaccaca 


atatttatca 


1 D 


ccaagatggc 


caaataggaa 


cagctccggt 




acgggtgatt 


tctgcatttc 


catctgaggt 




cagtgggcgc 


aggtcagtgg 


gtgcgtgcac 


20 


gcctcacttg 


ggaagcgcaa 


ggggtcaggg 




acggacggca 


cctagaaaat 


cgggtcactc 




ttaaaaaacg 


gcgcaccacg 


agattatatc 




cggagtctcg 


ctgattgcta 


gcacagcagt 




ctgggggagg 


ggcgcccgcc 


attgcccagg 


30 


ctccaactgg 


gtggatccca 


cgacagctca 




ctctgggggc 


agggcacaga 


caaacaaaaa 




ccctgtctga 


cagctttgaa 


gagagcagtg 




agactgcctc 


ctcaagtggg 


tccctgaccc 




ccccccagca 


ggggcacact 


gacacctcac 


40 


gagggtcctc 


tctgttagaa 


ggaaaactaa 




catctgtaca 


tcaccatcat 


caaagaccaa 




aacagaacag 


aaaaactgga 


aactctaaaa 




gcagttcctc 


accagcaacg 


gaacaaagct 




gaggcttcag 


acgatcaaat 


tactctgagc 


50 


aggttgaaaa 


ctttgaaaaa 


aatttagaag 




agtgcttaaa 


ggagctgatg 


gagctgaaaa 


55 


gaagcctcag 


gagtcgatgc 


gatcaactgg 




tgaatgaaat 


gaagcgagaa 


gggaagttta 




gcctccaaga 


aatatgggac 


tatgtgaaaa 


60 


aaagtgatgg 


ggagaatgga 


accaagttgg 
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aggcataaac 


tggataaggc 


aagaatgttt 


69480 


cctggtacct 


ggcagaagca 


aaaccgaagt 


69540 


aataaattcc 


catcaaattt 


tccattgaaa 


69600 


aaagaaataa 


gacaatgcga 


ataacaacta 


69660 


acctgtaaat 


ttctgatact 


aagtatatta 


69720 


tgttttaaga 


aataaatgag 


gggggaggag 


69780 


ctacagctcc 


cagcgtgaga 


gacgcagaag 


69840 


acccggttca 


tctcactagg 


gagtgccaga 


69900 


cctgcgcgag 


ccgaagcagg 


gcgaggcatt 


69960 


agttcccttt 


ctgagtcaaa 


gaaaggggtg 


70020 


ccacccgaat 


actgcgcttt 


tccgacgggc 


70080 


ccgcacctgg 


ctcggagggt 


cctatgccca 


70140 


ctgagatcaa 


actgcaaggc 


ggcagcaagg 


70200 


cttgcttagg 


tatacaaagc 


agctgggaag 


70260 


aggaggcctg 


cctgcctctg 


taggctccac 


70320 


gacagcagta 


acctctgcag 


acttaaatgt 


70380 


gttctcccag 


ctggagatct 


gagaacgggc 


70440 


ctgacccccg 


agcagcctaa 


ctgggaggca 


70500 


acggcagagt 


actccaacag 


acctgcagct 


70560 


caaacagaaa 


ggacatccac 


accaaaaacc 


70620 


aagtagataa 


aaccacaaag 


atggggaaaa 


70680 


agcagagtgc 


cactcctcct 


ccaaaggaat 


70740 


ggatggagaa 


tgactttgac 


gagttgagag 


70800 


tatgggagga 


cattcaaacc 


aaaggcaaag 


70860 


aatgtataac 


tagaataacc 


aatacagaga 


70920 


ccaaggctcg 


agaactacgt 


gaagaacgca 


70980 


aagaaagggt 


atcagcgatg 


gaagatgaaa 


71040 


gagaaaaaag 


aataaaaaga 


aatgagcaaa 


71100 


gaccaactct 


acatctgatt 


ggtgtaccta 


71160 


aaaacactct 


gcaggatatt 


atccaggaga 


71220 



-256- 



WO 02/062946 



PCT/US01/48437 



acttccccaa tctagcaagg caggccaacg 
aaagatactc ctcgagaaaa gcaactccaa 
5 aaatgaagga aaaaatgtta agggcagcca 

agctcatcag actaacagtg gatctctcgg 
ggccaatatt caacattctg aaagaaaaga 

10 

aactaagctt cataagtgaa ggagaaataa 
attttgtcac caccaggcct gccttacaag 
15 ggaacaactg gtaccagccg ctgcaaaatc 

gaagaaactg catcaactaa cgagcaaaat 
attcacacat aacaatatta actttaaatg 

20 

acagactggc aaattggata aagagtcaag 
atctcatgtg cagagacaca cataggctca 
25 caaatggaaa acaaaaaaag gcaggggttg 

aatcaacaaa gatcaaaaga gacaaagaag 
aacaagaaga gctaactatc ctgaatatat 

30 

taaagcaagt cctgagtgac ctacaaagag 
actttaacac cccactgtca acattagaca 
35 cccaggaatt gaactcagct ctgcaccaag 

acgccaaatc aacagaatat acattttttt 
accacatatt gggaagtaaa gctctcctca 

40 

actatctctc agaccacagt gcaatcaaac 
actgctcaac tacatggaaa cagaacaacc 
45 aaatgaaggc agaaataaag atgttctttg 

agaatctctg ggatgcattc aaagcaggat 
acaagagaaa gcaggaaaga tccaaaattg 

50 

acaagcaaga gcaaacacat tcaaaagcta 
cagaactgaa ggaaatagag acacaaaaaa 
55 ggttttttga aaggatcaac aaaattgata 

gagagaagaa tcaaatagac gcaataaaaa 
cagaaataca aacgaccatc agagaatact 

60 

atctagaaga aatggataaa ttcctcgaca 



ttcaaattca 


ggaaatacag 


agaacgccac 


71280 


aacacataat 


tatQacrattc 


accaaagtgg 


71340 


aaQaoaaacfQ 


ttqqqtaacc 


ctcaaagqga 


71400 


caQaaaccct 


acaaqccaqa 


aqaqaqtqqg 


71460 


attttcaacc 


caciaatttca 


tatccagcca 


71520 


aatact t tac 


agacaagcaa 


atgctgagag 


71580 


aactcctaaa 


Qcjaacjcac ta 


aacatggaaa 


71640 


atcrccaaaat 


gtaaagacca 


tcgagactag 


71700 


aaccacrctaa 


catcataatq 


acaggatcaa 


71760 


taaatcraact 


aaatqctcca 


attaatagac 


71820 


acccatcaqt 


gtgctgtatt 


caggaaatgc 


71880 


aaataaaaQQ 


atQQacjQaaQ 


ttctaccaag 


71940 


caatcctaqt 


ctctgataaa 


acagacttta 


72000 


qc cat tac at. 


aatggtaaag 


qqatcaattc 


72060 


atacacccaa 


tacaqqaqca 


ccaagattca 


72120 


acttacrac tc 


ccacacatta 


ataatqqqaq 


72180 


aatcaacQaq 


acagaaagtc 


aacaaggata 


72240 


caaacctaat 


agacatctac 


agaactctcc 


72300 


cagcaccaca 


ccacacctat 


tccaaaattg 


72360 


acaaatataa 


aagaacagaa 


attataacaa 


72420 


t aaaac t cacj 


qattaaqaat 


ctcactcaaa 


72480 


tactcctqaa 


tqactactqq 


qtacataatq 


72540 


aaaccaatqa 


gaacaaagac 


acaacatatc 


72600 


QtacraQQaaa 


atttaaagca 


ctaaatgccc 


72660 


acaccctaac 


atcacaatta 


aaagaactag 


72720 


ocaaaaacca 


acraaat aact 


aaaatcagag 


72780 


ctcttcaaaa 


aattaatoaa 


tccaqqaqct 


72840 


gactgctagc 


aagactaata 


aagaaaaaaa 


72900 


atgataaagg 


ggatgtcacc 


accgatccca 


72960 


acaaacacct 


ctacgcaaat 


aaactagaaa 


73020 


catacactgt 


cccaagacta 


aaccaggaag 


73080 
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aagttgaatc 
taccaaccaa 
aggaggaact 
tccctaactc 
ccaaaaaaga 
aaatactggc 
gcttcatccc 
atataaacag 
ttgacaaaat 
cgtatttcaa 
ggcaaaaact 
cactcctatt 
taaagggtat 
ttgtatatct 
tcagcaaagt 
acaacagaca 
gaataaaata 
acaaaccact 
catgggtagg 
tcaatgccat 
taaagttcat 
acaaaactgg 
aaacagcatg 
cagaaataac 
aatggggaaa 
gaaagctgaa 
aagacttaaa 
ttcaggacat 
aagacaaaat 
tcaccatcag 



tctgaataga 
aaagagtcca 
ggtaccattc 
attttatgag 
gaattttaga 
aaactgaatc 
tgggatgcaa 
agccaaagac 
tcaacaaccc 
aataataaga 
ggaagcattc 
caacatagtg 
tcaattagga 
agaaaacccc 
ctcaggatac 
aacagagagc 
cctaggaatc 
gctcaaggaa 
aagaatcaat 
ccccatcaag 
atgggaccaa 
aggcatcaca 
gtactggtac 
gccgcatatc 
ggattcccta 
actggatccc 
cgttagacct 
aggcatgggc 
tgacaaatgg 
agtgaacagg 



ccaataacag 
ggaccagatg 
cttctgaaac 
gccagcatca 
ccaatatcct 
cagcagcaca 
ggctggttca 
aaaaaccaca 
ttcatgctaa 
gctatctatg 
cctttgaaaa 
ttggaagttc 
aaagaggaag 
attgtctcag 
aaaatcaatg 
caaatcatga 
caacttacaa 
ataaaagagg 
atcgtgaaaa 
ctaccaatga 
aaaagagccc 
ctacctgact 
caaaacagag 
tacaactatc 
tttaataaat 
ttccttacac 
aaaaccataa 
aaggacttca 
gatctaatta 
caagctacaa 



gagctgaaat 
gattcacagc 
tattccaatc 
ttctgatacc 
tgatgaacat 
tcaaaaagct 
atatatacaa 
tgattatctc 
aaactctcaa 
acaaacccac 
ctggcacaag 
tggccagggc 
tcaaattgtc 
cccaaaatct 
tacaaaaatc 
gtgaactccc 
gggatgtgaa 
atacaaacaa 
tggccatact 
ctttcttcac 
acatcgccaa 
tcaaactata 
atatagatca 
tgatctttga 
ggtgctggga 
cttatacaaa 
aaaccctaga 
tgtctaaaac 
aactaaagag 
aatgggagaa 



tgtggcaata 
cgaattatac 
aatagaaaaa 
aaagccgggc 
tgatgcaaaa 
tatccaccat 
atcaataaat 
aatagatgca 
taaattaggt 
agccaatatc 
acagggatgc 
aattaggcag 
cctgtttgca 
ccttaagctg 
acaagcattc 
attcacaatt 
ggacctcttc 
atggaagaac 
gcccaaggta 
agaattggaa 
gtcaatccga 
ctacaaggct 
atggaacaga 
caaacctgag 
aaactggcta 
aatcaattca 
agaaaaccta 
accaaaagca 
cttctgcaca 
aatttttgca 



atcaatagct 
cagaggtaca 
gagggaatcc 
agagacacaa 
atcctcaata 
gatcaagtgg 
gtaatccagc 
gaaaaagcct 
attgatagga 
atactgaatg 
cctctctcac 
gagaaggaca 
gatgacatga 
atcagcaact 
ttatacacca 
gcttcaaaga 
aaggagaact 
attccatgct 
atttacagat 
aaaactattt 
agccaaaaga 
acagtcacca 
acagagccct 
aaaaacaagc 
gccatatgta 
agatggatta 
ggcaatacca 
atggcaacaa 
gcaaaagaaa 
acctactcat 



73140 
73200 
73260 
73320 
73380 
73440 
73500 
73560 
73620 
73680 
73740 
73800 
73860 
73920 
73980 
74040 
74100 
74160 
74220 
74280 
74340 
74400 
74460 
74520 
74580 
74640 
74700 
74760 
74820 
74880 
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ctgacaaagg 


gctaatatct 


agaatctaca 


atgaactcaa 


acaaatttac 


aagaaaaaaa 


74940 


caaacaaccc 


catcaaaaag 


tgggtgaagg 


acatgaacag 


acacttctta 


aaagaagaca 


75000 


tttatgcagc 


caaaaaacac 


atgaaaaaat 


gctcaccatc 


actggccatc 


agagaaatgc 


75060 


aaatcaaaac 


cacaatgaga 


taccatctca 


caccagttag 


aatggcaatc 


attaaaaagt 


75120 


caggaaacaa 


caggtgctgg 


agaggatgtg 


gagaaatagg 


aacactttta 


cactgttggt 


75180 


gggactgtaa 


actagttcaa 


ccattgtgga 


agtcagtgtg 


gcaattcctc 


agggatctag 


75240 


aactggaaat 


accatttgac 


ccagccatcc 


cattactggg 


tatataccca 


aaggactata 


75300 


aatcatgctg 


ctataaagac 


acacgcacac 


gtatgtttat 


tgcggcatta 


ttcacaatag 


75360 


caaagacttg 


gaaccaaccc 


aaatgtccaa 


caatgatagg 


ctggattaag 


aaaatgtggc 


75420 


atatatacac 


catggaatac 


tatgcagcca 


taaaaaatga 


tgagttcatg 


tcctttgtag 


75480 


ggacatggat 


gaaattggaa 


atcatcattc 


tcagtaatct 


atcgcaagaa 


caaaaaacca 


75540 


aacaccgcat 


attctcactc 


ataggtggga 


attgaacaat 


gagatcacat 


ggacacagga 


75600 


aggggaatat 


cacactctgg 


gaactgttat 


ggggtggagg 


gaggggggga 


gggatagcat 


75660 


cgggagatat 


acctaatgct 


agatgacgag 


ttagtgggtg 


cagggcacca 


gcatggcaca 


75720 


tgtatacata 


tgtaactaac 


ctgcacaatg 


tgcacatgta 


ccctaaaact 


taaagtataa 


75780 


taaaaaaaaa 


gaaaaagaaa 


aaggccatta 


aaaaaaaaaa 


gaaaaaaaaa 


gaaaggagat 


75840 


aagccaatag 


ttttccaagc 


tgacttagaa 


actcgacaaa 


aaaaatgggt 


atatccttac 


75900 


tatttgtcag 


cataacctac 


cttctcttat 


tatctgtcag 


tctgaaatac 


tcctcaataa 


75960 


taaaaatgtc 


aagacttgaa 


aaaaaaaaag 


aaagaaatga 


taaactttaa 


aatacctcag 


76020 


ggaatataaa 


ataaaagaaa 


gtgaactaga 


agattttctg 


tgtaaaacta 


ttatgtaccc 


76080 


ataaaaatag 


aaaattaaaa 


aatttaaaat 


tttaaaaaaa 


gatggtccca 


gtaaataaga 


76140 


gtttaaaaaa 


caaaacacaa 


ctgcattctg 


tttacaagaa 


acagctacac 


tagtaacaaa 


76200 


caaatagatt 


taaggcaaga 


agcattgcta 


aacgtataat 


aggggacttt 


ctataataaa 


76260 


agggtcgatt 


cacaagagaa 


atattacatt 


tctaggtttg 


aatgctccta 


ataacaactc 


76320 


aaagccaaaa 


ggaaaagcta 


actcagctga 


atgagtagag 


aaagctaaaa 


tcatcataag 


76380 


aaattttagc 


acacatttct 


cagcaactga 


aagaatatgt 


agacagacca 


taaaattaca 


76440 


gaattttgaa 


tacaattgat 


aaaattgacc 


taaatgtcat 


atatagaatg 


aacactacac 


76500 


V, > CL Q U» CJ. G1. *w CT. 


caattataat 


actatcaact 


aacattttta 

CA k*» L— V— 


attaacaaat 


aaaat t a tat 


76560 


gtatttatca 


tgtataacat 


attttgaagt 


acatatatgc 


attgtggaat 


gtttacatct 


76620 


agctaattaa 


caaaggcatt 


acctcccata 


gttatcactt 


ttgcagtgag 


aacacttaac 


76680 


attcactatc 


tttgcacttt 


taagaataca 


atagccatca 


ttctcagcaa 


actaagacag 


76740 



-259- 



WO 02/062946 



PCT/US01/48437 



gaacagaaaa 
catggacaca 

ggagggagag 

ttgataagtg 
catgtatccc 
aactatgggc 
tgatggaagt 
acccagaatt 
cagtggctca 
gtcaggagtt 
aaaattagca 
ggagaatcac 
tccagcctgg 
atgaaagtga 
aattagccat 
agtatattgt 
aaatccccaa 
cataaaaaaa 
gtgcttagag 
agtaaattat 
tgaaagaatt 
atgaaataca 
aactaataaa 
tcattatcag 
ttattaacat 
caaaactatg 
ggcttatatc 
gcaaagcctt 
actgaatact 
tgacatgaaa 



ccaaacacct 
gggaagggaa 
ctttagaaca 
cagcaaacca 
agaacttaaa 
accccgctgt 
gataagacaa 
ctaggcctgg 
ggcctgtaat 
tgagaccagg 
gggcatggtg 
ttgaacctgg 
gcagcagagc 
aataaaaaca 
gtgctgagcc 
ctgaaaacaa 
aactgttttg 
ttaaaaacaa 
gaaactttac 
ctaaatattc 
gagatatagt 
acacaaacca 
attggtaaac 
aaatgaaggt 
aaaaattatt 
caaaaccacc 
taataaaaaa 
gcaggagcag 
ttcagttcac 
acataaggac 



tatgttctca 
tatcacacac 
aatacctaat 
ccattgcaca 
gtaaaataat 
acaacagatc 
tagaattaca 
tgaaaatgtt 
cccaggactt 
ctggccaaca 
gtgggcagct 
gaggcggaag 
aagactccgt 
actgaaaaaa 
tacaagcctc 
tggaattgaa 
aaattctaag 
aataatagaa 
gacctcaaag 
ataccaagaa 
attctataat 
tggataagat 
ccctagatat 
gtatcactat 
tgatgcatta 
cgccatgact 
tcaaatacat 
actaatttct 
ttcccaactc 
attataaaat 



ctcataagtg 
caaggcctgt 
gcgtgtgggg 
tgtatacctg 
aataaaaaaa 
tcttgaactt 
tctttaaagt 
cttcaaaaat 
tgggaggcca 
tggtgaaaca 
ttaatcccag 
ttgaagtgag 
ctcaaaaaca 
ttgacagatt 
aacaaaattc 
ttaaatgtca 
ttatacaagg 
atatttatat 
gcaagtattg 
attagattaa 
agacataaca 
aaaataaagc 
ttaaaaagaa 
caatacaata 
gatgattgtt 
aacacaggaa 
aatttaaaac 
tccaaacatt 
acgagtattc 
atttaaaaat 



agagttgaac 
tggggggtag 
cttaaaacct 
tgtaacctgc 
gaatacaata 
attcctccta 
gctcaaacaa 
gaaagtgaaa 
aggtgggcag 
ccatctctac 
ctactcggga 
ccgggactgc 
acaacaacaa 
tcacttagaa 
aaagaattga 
attttaaaaa 
ctcaaagaat 
aaattgatta 
ggaaggaaaa 
aaaattaaat 
aagaatagag 
taaaatatgg 
aaaagagaaa 
gccaagaaca 
tccttataaa 
aaaatgtata 
tttctctcaa 
tagaaaagaa 
atgtgaggct 
tacgggtaac 



aatgagaaga 
tgggacaagg 
cgatgacagg 
acgtcctgca 
tactgtcatt 
tctaactaac 
aataatgcca 
tagccaggcg 
atcacttgag 
taaaaataca 
agctgaggca 
accactgcac 
caaaaacaca 
cattttcaag 
aatcatatac 
ggcaaaaaaa 
ttacaacaga 
caactaaagt 
tagactaaaa 
ttaaagaaaa 
ataaaaacta 
atgtttgaaa 
acacaaacta 
tatgaggaaa 
acctacctac 
tatctgattg 
aaaagaaaaa 
acaacaccaa 
gtcacaagtt 
tctatcttgt 



76800 
76860 
76920 
76980 
77040 
77100 
77160 
77220 
77280 
77340 
77400 
77460 
77520 
77580 
77640 
77700 
77760 
77820 
77880 
77940 
78000 
78060 
78120 
78180 
78240 
78300 
78360 
78420 
78480 
78540 
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aaactttgtg aagaaatctg aaataaaata 
ataatatacc gtggccaagt cagtggatat 
5 catcaatcag tgatacacct aaatgtgaaa 

tgcagggaac ttaggagaat atcttcacga 
aaacataaaa gggactaaat ataggggaaa 

10 

aaagggagaa aatttcctct ctctatatat 
aatatattaa gcactcctga aaatcaataa 
15 gcaagcagtt cacaaagaaa gatagataaa 

cattattaat catcagataa atgcaaatta 
ccagaatcat tttaattaaa aggaccacca 

20 

ctggaactct catacccttt tgatggaaat 
aaaactattg ggtggttttt taaaaagtta 
25 agacaaaaat gtttatagca gtcctattca 

tctatcaaca acaaaatggc taattattca 
gaacaaccta ctgacacaga ggagcatgaa 

30 

aatttcagat aaaaaataaa caggcaaatt 
gttgactctc aatagggtgg gaggattgat 
35 tgataaaaat gtgatatatc ttgacgtgga 

tttttttttt gagatgaagt ctcactctgt 
cggcttactg caacctccat ctcccaggtt 

40 

tagctgagat tacaggctca tgctgccatg 
tggggtttca gcatgttgcc caggctggtc 
45 cctcggcctc ccaaagtgct aggattacag 

cttctttata tataaacttt gatatagtta 
acagcaacat tttcatgtgt ttatatacag 

50 

aaattttctt tttaaaagat aaagaatatg 
tggccaaaca tacaataatg ctccaattca 
55 acaatgaact aatatttaca tcactagact 

tggtatttat cactggtgaa aacaatgttc 
cactgccatt ttgggaagca aggatacaac 

60 

ttaacccaga tgtctatctc tacagtaatt 
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ttaacaaaat 


caaattcagc 


ataatatatg 


78600 


cagaagtaca 


caattgattt 


cacattcaaa 


78660 


tcctaaacta 


taaagcttga 


gtgggtgggc 


78720 


ccatggtagt 


agacaaagct 


ttcttagata 


78780 


aataaaataa 


aatggactac 


caaagcacag 


78840 


atctctctct 


cacaaaagac 


ccacatccag 


78900 


taaaaaggcc 


aaaaaaagat 


taagacttga 


78960 


tggctagtga 


gcatatgaga 


acgtgctcaa 


79020 


aaatcacaat 


aagacactac 


tccacacaca 


79080 


ataccgggtg 


ttggaaggaa 


tgtgaagcaa 


79140 


gggaaacgga 


aaatggttca 


accaccttgg 


79200 


atcatacacc 


tactttatgt 


ctcatgatat 


79260 


ttatagttaa 


actgataagg 


aactcaaatt 


79320 


aataatagaa 


tacttctcag 


caatataaaa 


79380 


tgaatttcaa 


tttattatac 


cgaggcaaac 


79440 


taacacatga 


tgctagaaat 


cagaagggtg 


79500 


tgggaaggga 


cacaagagaa 


cctactgggg 


79560 


aagaaaaaga 


gataaaagac 


atttcttctt 


79620 


tgcccaggct 


ggtgtgcagt 


gatgtcatct 


79680 


caagtgattc 


tcctgtctca 


gcctcccgag 


79740 


cccagctaat 


tttttgtatt 


ttagtaaaga 


79800 


tcaaactcct 


gagctcaggc 


aatccaccag 


79860 


gcatgagctg 


gcgcaaccag 


ccaaaataca 


79920 


aaaatggaaa 


atatagtttt 


aaagcagata 


79980 


agtgcatata 


actgacaaga 


agtcaagaaa 


80040 


aggaaagaat 


ttacaaaaag 


gaaactacaa 


80100 


ctaatagtca 


gggaaatgtc 


aattaattta 


80160 


ggcaaaaat t 


aatqaqttat 


aatttatggc 


80220 


ttgtatatca 


ggactagaag 


tgtgcaatgt 


80280 


atttacctac 


ctgaatgata 


aacagacctt 


80340 


aaaccacctg 


tatataatgg 


tatatgtaca 


80400 
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acaatgctat tgcagcattg ttcagagggg 
aaagaggaaa tcaatcagtt cctgtcacaa 

5 

atgaaatatt aagtggccat taaaaaccat 
ttaaaatata gtagacaaat aaggcttcag 
10 acctaagtat aaagtaaagc tctcagaaga 

gcagcctctg ggctactgac aagatgtgta 
caaaatattt tgaaggtctt ttgctgtgat 

15 

gtggctcatg cctgtaatcc cagcactttg 
caggagttca gcgctgccaa catggcgaaa 
20 ccgggtttgg tggtgggcac ctgtaatccc 

gcttgaacct gggagttgga ggttgcagtg 
ggctgataag agtgaaactc cgtctcaaaa 

25 

aacggtgaac tactagcgta tgcttcttat 
atagttttct tttttccctc tttttctttt 
30 ccttctttct ttccccttca ttagaccacc 

catttataaa atggactatt tcctacatat 
tctctaaaaa gataatatgc tatgctatgc 

35 

tatgctatcc tatgctatgc tagctatgct 
tgctatgcta tgctatgcta tgctatccta 
40 tcctatgcta tgctatccta tgctatgcta 

tactatgcta tgctatccta tggtatccta 
tgctatgcta tgctatgcta tgctatgcta 

45 

tgcaatgcag tgctatgcta tcctatgctg 
tgctatgcta tgctatccta ttctatgcta 
50 ctgaaaatcc ttcactgatt tgattcctaa 

gcttttatta tctgggcatc actatgtcat 
atcatgactc ttcatttttt tttcttcttc 

55 

tgactttcag cattattgaa ttattcattt 
cagaggattg tgattgtgaa caatgttgtt 
60 aaatattatg tagcaatgac ttctgcttgt 



taaaacaaac 


aaacaaaaaa 


acacagaaaa 


80460 


aagaataaat 


gaatgatgac 


aaactacacc 


80520 


gtggctctaa 


aatagctctt 


caatctatag 


80580 


gagaggatgt 


taaccatatt 


aacctttgca 


80640 


ctgcaaactt 


taactggtaa 


gtgcaggtct 


80700 


tctctatttg 


tgctcatagt 


tgccaaaata 


80760 


tgaaaaaaac 


acaggtgggg 


ccgggccatg 


80820 


ggaggccaag 


gtgggcagat 


cacctgaagt 


80880 


ccccgtctct 


actaaaaaca 


caaaaactag 


80940 


agctactcgg 


gaggctgagg 


caggagaatc 


81000 


agccgagatc 


ctgccattgt 


actccagcct 


81060 


aaaaaaaaaa 


gacaaaaaag 


aaaccaaaaa 


81120 


gtctctttcg 


ctttctcttt 


tactaagcca 


81180 


ctatttttag 


ttctttttcc 


tgtttcttca 


81240 


ttttatatat 


gcatttcttt 


gcattttatg 


81300 


ataaatcaaa 


aatgtatttt 


tatatgcctt 


81360 


tatgctatgc 


tatcccatgc 


tatgctatgc 


81420 


atgctatgtt 


agctatccta 


tgctatccta 


81480 


tgctatccta 


tgttatgcta 


tgctatgcta 


81540 


tgctatccta 


tgctatgcta 


tgctatccta 


81600 


tgctatgcta 


tgctatgcta 


tcctatgctg 


81660 


tgctatgcta 


tgctatgcta 


tgctatgcta 


81720 


tgctgtacta 


tgctatgcta 


tcctaggcta 


81780 


tgctatccta 


tgcttaaata 


gatgcatact 


81840 


gcatcagata 


cacttgagat 


atttggggaa 


81900 


tataatcagg 


aaaaccgtta 


ctgtgccaac 


81960 


ttctattaca 


gggaattttg 


attacattga 


82020 


ctctgccttt 


ctcaattttg 


gggtgccact 


82080 


gactagcact 


gtgagaataa 


agatgtgtta 


82140 


aagggaaagc 


caaaggacac 


atagaatctt 


82200 
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aaggctggga 


actctttgaa 


gcaaagggga 


gtggctcaag 


tgaaggggtt 


ctggggacat 


82260 


tttgactgat 


aaagcacatt 


ctgcctcttg 


gcaatgccat 


ctgcacctct 


ccccaaatca 


82320 


aatggtcacc 


ttggtgttgt 


cacctctgtt 


agtcatgcat 


tatttcaact 


aaaggacagc 


82380 


aatgtggtgt 


tgcaaagata 


ggaaccaaag 


cccaaggcct 


ggggtgaccc 


agtcatctaa 


82440 


cttctctggt 


tgatgacatc 


tcctgttgat 


ttcactaaga 


gactccaccc 


agtggctcct 


82500 


actccagtgg 


tgtgtggaaa 


gatgagctgg 


ttggtggaat 


ggtttggccc 


ataaattcta 


82560 


cattaatgct 


gcaagatcat 


caatgttaaa 


gagctgatca 


actagaatac 


taacaatagt 


82620 


aaaaataatg 


aaggcaaata 


ttggtggaac 


cactactata 


ttccaggcat 


cattc'taagt 


82680 


attatatagg 


tgttactcat 


ttaatctcac 


aaaagtccta 


ggaggtgaat 


tgtattatta 


82740 


tacccatttt 


acagaaaagg 


aaactgaagc 


acagaggggg 


taagtaattt 


gctcaaggtc 


82800 


ataaagtttg 


caatggcaga 


gccatagtat 


caacaggcag 


ccggttccag 


aatccaaggt 


82860 


cttaattcca 


atgccaagca 


aagtttggaa 


ggaactactt 


tcctctggat 


tctcttagca 


82920 


gcttttgcat 


tcaatctgtt 


atgatatctc 


gtgttctgta 


gcctatggag 


tatactactg 


82980 


taaacttgtg 


aaaagagaaa 


taaggtctta 


gtattattac 


aaaaatagtt 


ttgaacttat 


83040 


agggcaccta 


caaaggtctc 


aaggacccca 


ggaaatccga 


gaccacactt 


tgagaaccac 


83100 


taccttatat 


ggtattgcct 


ctcactggct 


gaatacctcc 


taagtattag 


aggtctcacc 


83160 


tgtatcatct 


cacttgtcct 


gacagccctc 


agggtagata 


tgtcttttgt 


catagtcata 


83220 


aaaattgaag 


ctcaaagtgg 


ttagcttttc 


tgagagcaca 


aaaggagcat 


gatttagaaa 


83280 


gaggatttga 


gtcaagttcc 


ctaacttcaa 


atcccattat 


cctatgttgt 


ttatggtata 


83340 


gaaaaaaaaa 


actcagcata 


ctcaaaaaat 


gaagtaacag 


gctttcattt 


ctactatttg 


83400 


ctggcagcag 


cttaggttat 


aggtgcattt 


gtgaaggtga 


cggatacaac 


atggcaggaa 


83460 


aaataagtaa 


gttttttact 


tcagtgaagt 


taggttatga 


tgaaaaatac 


ttggcaactg 


83520 


aatgatgtac 


agatgtgtga 


ggcttgagat 


atatttgagg 


gtgcaaagtc 


ttgggctttt 


83580 


aagtcaacta 


ggtttaaatg 


ttgtcttgaa 


gtcaaaaaat 


agtagatgca 


agttgtggag 


83640 


aaaagggaat 


gcttacacac 


ttctgggggg 


aatgtaaatt 


agttctgcca 


ctgtaaaaaa 


83700 


agcagtttgg 


caatttctca 


aagaactcaa 


agcaaaatta 


ctattcaacc 


cagcaatccc 


83760 


attattgggt 


acatacccaa 


aggaacataa 


atcgttctac 


cataaacaca 


tatgcacgta 


83820 


tatqt tcatt 


gcagcactat 


ttgcaatagc 


aaaaacqtqq 


aatcaaccta 


aatqcccatc 


83880 


aacagcagac 


tagataaaga 


aaatgtggta 


catatacacc 


atggaatact 


acatagccat 


83940 


agaaaagaat 


gagataatgt 


cctttgcagc 


aacatagatg 


aagctggagg 


ccagtattct 


84000 


aagcaaacta 


acacagcaac 


tgaaaagaaa 


atactgcatg 


tgttctcact 


tataagtggg 


84060 



-263- 



WO 02/062946 

agctaaacat tgagtacata tggacacaaa 
gagggtggag gatgggagga gggtgaggat 

5 

tacctgggtg acaaagtaac ctgtatatca 
acacatctgc acatgtaccc tgaacctaaa 
10 aaaattatta gaaataaaga tagaagactt 

ttaaacagat cccagaacat gaatagtacc 
tattaagaat gttacataag tgtgacgaga 

15 

caaataacca acaaaaataa tgggatatag 
aaaaatacaa cttgatgaaa acttagtcca 
20 gaaacaaagg ttaatatatg aaaagcactc 

actctcctac cacaatggta tgagtgtaaa 
agtaaaggta aagatacaca tactttataa 

25 

caagagaaat ccttgtacat gtgcaacata 
tgcctgcaat acctcaaatg tccatggaca 
30 ctggaggacc acatattggt aaaaataaat 

tacacattta atgttgagta aagaaagcaa 
taaataatga ttagaaaaca caaaaactgc 

35 

gtctccctct catgctgagc cgaagctgga 
cctccctgcc tgattctcct gactcagcct 
40 gccacgcctg actggttttg gtggagacgg 

agcccctaac cgcaagtgat ccgccagcct 
gaatctcgtt cactcagtgc tcaatggtgc 

45 

ctcgctacaa cctccacctc ccagccgcct 
gcctctgccc ggctgccatc ccgtctggga 
50 cgtctgggat gtgaggagcc cctctgcctg 

ccgcccggac gccatcccat ctaggaagtg 
ctaggaagtg aggagcgtct ctgcccggcc 

55 

gccccgccga cccgtctggg atgtgaggag 
tggtgaggag catctctgcc cggccgcccc 
60 cagccgcccc gtctgagaag tgaggagcct 
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gaagggaaca 


acaggcactg 


gggcctactt 


84120 


caaaaaacta 


ctattgggta 


ctatgcttat 


84180 


aacccccgtg 


acatgccact 


tacctatata 


84240 


ataaaattta 


aaaaacaaag 


aagaaatgtt 


84300 


ctttacaaag 


tcaaggtagt 


aaagaatttc 


84360 


agcaagaact 


gatacaaccc 


atttagtcgt 


84420 


gcagctacaa 


atcaaagcct 


tttagcaaca 


84480 


aatatataaa 


atcctcctat 


acattaatag 


84540 


aagccatgca 


cagccatttc 


acaggagaca 


84600 


aatttcttaa 


gcaattgtgg 


aatgaaagga 


84660 


tgaagacaac 


cactttaaaa 


actattccct 


84720 


cccaacaatt 


ccactgctag 


ttaatatacg 


84780 


tgacacgtac 


gaatgtttcc 


agagcaggat 


84840 


gagaacagat 


caattgtacc 


atgttcccta 


84900 


tacttgaagc 


aaatcaacaa 


agatgaatta 


84960 


gtcacaaaag 


aacatcttat 


taattctacc 


85020 


tctctccctc 


tccctctccc 


tctccccgca 


85080 


ctgtactgct 


gccatctcgg 


cttactgcaa 


85140 


gccgagtgcc 


tgcgattgca 


ggctcgcgcc 


85200 


ggtttcgctg 


tgttggccag 


gccggtctcc 


85260 


cggcctcccg 


aggtgccggg 


attgcagacg 


85320 


ccaggctgga 


gtgcagtggc 


gtgatctcgg 


85380 


gccttggcct 


cccgaagtgc 


cgagattgca 


85440 


agtgaggagc 


gtctctgcct 


ggccgcccat 


85500 


gctgcccagt 


ctggaaagtg 


aggagcgtct 


85560 


aggagcacct 


cttcccagcc 


gccatcacat 


85620 


gcccatcgtc 


tgagatgtgg 


gagcgcctct 


85680 


cacctctgcc 


cggccgcgac 


cccgtctggg 


85740 


atctgagaag 


tgaggagccc 


ctccgcccgg 


85800 


ctccgcccgg 


cagccacccc 


gtctgggaag 


85860 



-264- 



WO 02/062946 

tgaggagcgt ctcccggcag ccaccccgtc 
ccaggccagc cgccccatcc gggagggagg 
5 acccgtccgg gaggtgaggg gcgcctctgc 

cctctgcctg gcatcacccc gtctgggagg 
tgatgacgat ggcggttttg tggaatagaa 

10 

atcagattgt tgctgtgtct gtgtagaaag 
gtactaagaa aaattcttct gccttgggat 
15 tgtgctctct gaaacatgtg ctgtgtccac 

gatgtgcttt gttaaacaga tgcttgaagg 
cctaatctca agtacccagg gacaaaaaca 

20 

gaaagccaga gacctttgtt cacttgttta 
tgaccctgcc aaatccccct ctgtgagaaa 
25 gagactataa acataatttg agtgtttaga 

taaataaata aaaataaaat taaaaaaatg 
aaagctggag aaaataaata ctaaagtgtt 

30 

aattagagga ggatgtgaaa gcacagcgta 
ttgaggtgaa agagaaataa ccataaaagg 
35 attggtttgg agaacggaaa taaaatctac 

gcacacaggt tggagcagag aaagaggaaa 
cattgatacg ttcttgccta tctctcctcc 

40 

acctgtgtgg gttctgttct ctagattgga 
gaggccagtt gaaaaatgag ggagaataaa 
45 agaagaggaa tgaaaaagta agttggttgc 

aaaaaaaaag ctaatgctcg ctgggtgcgg 
gaggccgagg caggagaatc acctgaggtc 

50 

tgaaacccca tctccactaa aaatacaaaa 
atcccagcta ctcggggggc tgaggcagga 
55 cagtgagccg agatcgctcc attgcactcc 

taaaaaaaaa aaaaaaaaaa gctaatgcta 
ccattccatg aacatgggcc caaaaggacg 

60 

atgtggctac cacagggaca acatagaaca 
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cgggagggag 


gtgggggggt 


cagtcccccg 


85920 


tggggggtca 


gacccccgcc 


cggacagccg 


85980 


ccggccgctc 


ctactgggaa 


gtgaggagcc 


86040 


tgtacccaac 


agctcattga 


gaacgggcca 


86100 


aagggggaaa 


ggtggggaaa 


agatagagaa 


86160 


aagtagacat 


gggagatttc 


attttgttct 


86220 


gctgttgatc 


tatgacctta 


cccccaaccc 


86280 


tcagggttca 


atggattaag 


ggcggtgcaa 


86340 


cagcatgctc 


gttaagagtc 


atcaccactc 


86400 


ctgcggaagg 


ccgcagggtc 


ctctgcctag 


86460 


tctgctgacc 


ttcccttcac 


tattgtccta 


86520 


cacccaagaa 


tgatcaataa 


aaaaaaaaaa 


86580 


aatttaaaaa 


taaaatccca 


gtaaaaaaaa 


86640 


cagtatgata 


ttgcaatgat 


tgagcattag 


86700 


tgggagggtc 


acaataggct 


tcccagtaaa 


86760 


ttccaaactt 


tacagttgca 


tttacatctc 


86820 


actctggaac 


tacataaaga 


tacaagacaa 


86880 


ctataaaatc 


caggtaaata 


tctgatactg 


86940 


catagaggtg 


ccaaaggaac 


aaagtaagtc 


87000 


aaatcaatgg 


gcacaaactg 


tggctggtct 


87060 


gggatgaaga 


caagttcttg 


actctatgtt 


87120 


accatgaacg 


aaacaagaaa 


gaaacaaaac 


87180 


tgtacaagag 


ataaaacgga 


ctgcaaaaag 


87240 


tggctcacac 


ctgtaaaccc 


agctctttgg 


87300 


aggagttcga 


gaccagcctg 


cctaacatgg 


87360 


attagctggg 


cgtagtggca 


ggtgcctgta 


87420 


gaatcacgtg 


aactcgggag 


gcggaggttg 


87480 


dycccyyy uy 
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agaaaggtga 


ataataggtt 


caactgtatg 


87600 


a ggccatgcc 


ccactgggca 


cttgatgatt 


87660 


cagaagtaat 


actggagccc 


ttggctttgt 


87720 
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aacacatttt aattaaggta cttctttctg 
gggaaagaac attcattttc accagatcag 

5 

ctgtggatgt actgggtgat tttgtgaaaa 
taaagtactc ttgaaaataa taatcatctt 
10 atgaggtaga aagaaactat aaagaagtaa 

atcacagagc aaattgctga ggttttctct 
tcagagctcc aatattttta tgtgttcctt 

15 

tcttattttt aaaaaaagga caactagggg 
tcctactaag tcaataggtg gaatctataa 
20 tgtatcattt acagaattgt cttcatttca 

gctctgtctt taattccttt tataaaacca 
tcctaataac tatagaccta gccatgaata 

25 

tagtccagaa actaaacagc agatatgaaa 
attgagtttt cctcataaaa tttctcgaga 
30 agaatgtaat gaaatcatat acatgcagaa 

taaaagctca aaaaaattct tgttctggtt 
tttctttgct cttgaaattg accctgttca 

35 

atccactgcc taaaacaaga tagaatacct 
tttaagctct tgccatgtgc taggcactgt 
40 tgaactttag actagcctcc ttaggtacct 

taacagggat ttcattaaga aacttgctta 
aagattagaa aagagggctt gagcaatatt 

45 

taacctcaca ttctcaaatg ctattgagag 
atggacttgt ttcctttttc cttgtttaaa 
50 attttttttt ttttggagat ggagtctcgc 

gatctcggct cactgcaagc atccacctcc 
tccaagtaac tgggaataca gggggacacc 

55 

tgtattttta gtagagacgg ggtttcacca 
ctcgggatct gcccgcctcg gcctcctaaa 
60 aaattcctct gtaatcagat atttcagaaa 



acaagaaaac 


aattaaaatt 


aaaataggta 


87780 


cattttgtga 


aaggtgatat 


aagtgtatac 


87840 


tgattcggtt 


tcatgttttg 


ctttatgcta 


87900 


cactctgtca 


aaagtgaagt 


tagcatgaaa 


87960 


atttttccca 


tttttatttg 


aaataatgta 


88020 


aattgcttta 


gtcttatact 


tttgatctca 


88080 


gttcttaaaa 


ataattgcag 


ttgttctaat 


88140 


taaatatttt 


gtcaactgtc 


aagagaacaa 


88200 


aaggagtcta 


tttattagtt 


tcccttctca 


88260 


attgaataat 


taaagaatca 


gaactggctg 


88320 


gaacttgacc 


cagtaaaaat 


gtattctgag 


88380 


atctgggttc 


tatgtgaaat 


ttacagatct 


88440 


ggagaaactg 


ttgataaata 


aatgaaaagt 


88500 


caaaaaaaaa 


gcaatgatca 


acagcaaagg 


88560 


acgtggacaa 


ttctttaatt 


gtatgctttt 


88620 


aattactgta 


ctctgagtac 


ccatctccta 


88680 


attcctttta 


taaacttttc 


ctatggccac 


88740 


tgggatctaa 


taataaaata 


atagttcctt 


88800 


tccaacagtt 


ttgcataaat 


taggataatt 


88860 


tctgtcattt 


tacagttgag 


acataggaac 


88920 


aagtcccaga 


cttagtgaag 


tagcagattg 


88980 


tggccaccga 


aaccagatcc 


ataggatgta 


89040 


ttacgtggtg 


ttactccaag 


agtcacagct 


89100 


ataaactttt 


gtgggctcaa 


atgtgtttac 


89160 


tctgtcaccc 


aggtgggagt 


gcagtggcac 


89220 


cggggtcaca 


ccattctcct 


gcctcaacct 


89280 


cctacacccg 


gggaaatttt 


tttttttttt 


89340 


tgttagccag 


gatggtctcg 


atctcctgac 


89400 


gtgctggatg 


tgtttccatt 


ttgaaaaaaa 


89460 


gatgttttct 


cccttttcat 


tgagttttca 


89520 
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accattttca 


gttgcatcta 


gtatcggagg 


gtaagggtta 


tgagtacagg 


ttctggatta 


89580 




aattctctca 


ttgtgaaatg 


gggattttag 


tagagtacct 


atctcatggg 


gttctgtaaa 


89640 


5 


gggtacagag 


ataatgtatc 


tcatgcactt 


aacatagaaa 


taataataga 


gccaaccctg 


89700 




taatcggcac 


aattccacat 


actttgcaga 


gaataactct 


ttaatactca 


aactgactct 


89760 


10 


ctaaggaagg 


tagtcttatt 


agctctattt 


tacagacaga 


aaacaaaggc 


aaagagaggt 


89820 


taacaaactt 


gccaaagaca 


caaagttgga 


gttagatccc 


aagttcatgt 


tcctaaacat 


89880 




gacactatac 


agaccctact 


gtgcctataa 


taagtcttta 


ataaatgtta 


gctctctata 


89940 


15 


cacctcacct 


aaagcttgct 


aaagagaatc 


cttactaact 


cactcaatca 


caaagaagtt 


90000 




aatccggttt 


ataaactcct 


ctccatggca 


tcagaagaca 


gaagttttgt 


tgttgggtag 


90060 


20 


ttgcaggagc 


agagtcaata 


ccctgtcacc 


aaggctggcc 


tggaaaacaa 


getgeaaatg 


90120 




tccagagtga 


tgctgcatgg 


agggggattc 


tgaaccaaca 


gccaagaatc 


ccattttcaa 


90180 




gtcctaactc 


tgccacatgt 


tttctgtgta 


accttgagca 


tgtcacttaa 


cgactcagct 


90240 


25 


tcaatttctc 


atctgtaaat 


caaggggaat 


gagacaggat 


aaatggaaag 


ggtcttttct 


90300 




accctaaaat 


ttctggattc 


tatgagcaca 


gggtctgcat 


atgtgtcctt 


gttagatacc 


90360 


30 


agtgatacat 


tgtaccaata 


ggttattcat 


agtattcata 


cttcatacca 


agaatgagac 


90420 




attcatcact 


ctaactttct 


tgatattcat 


aagcacaaga 


aaattgttgt 


gcttttgaag 


90480 




acctcaaaaa 


aaggaatcta 


aggctaaaga 


caaatttttt 


aagtgtcata 


atttttttca 


90540 


35 


agaattaata 


gtcattgttg 


tccatattca 


gttgtactgg 


cttatttcca 


acaaaatttc 


90600 




ccttgcctac 


atggggcctg 


agatgctcta 


agctttcaga 


aggacagaaa 


tgaataaaag 


90660 


40 


catttttcct 


aaagaacaga 


agaatcatct 


taagtgggca 


tatgtgaact 


atctagtttc 


90720 




ccaattactg 


gcaaacaatt 


ttaatcttgg 


actgccctcc 


ttgccataca 


agtattgtat 


90780 




tacttggccc 


ctgtattagt 


tttctatttg 


ttgatataac 


aaactaccac 


aaatgtagtg 


90840 


45 


gcttaaaaca 


atataatttt 


tttatctgac 


agttctgtag 


gtcagaattc 


caataccagc 


90900 




tcactagact 


aagttcaagg 


tgtcagcaga 


gtcatgttcc 


ttcctggaga 


ctctagggaa 


90960 


50 


gaatctgttt 


ccttgtttca 


tccagcttct 


agaggcgacc 


catacacctt 


ggttcatgga 


91020 




cgccttcctc 


caacttcaca 


gttggcagca 


tagcatctct 


ctaatgaggg 


atggtttcag 


91080 




gatgaaacta 


ttccacctca 


ggtcatcagg 


aattagagtc 


tcataaggag 


catgcaacct 


91140 


55 


S3 f% SS't"^T' , « r " , ^" , .S5.a 

ay a L L. L. l_t,cici 


' — ct L.y ty ^ay l. 


L» cl\^ ay l a y y 


y l_ i— \— ci v— y v*. \— y 


\ ci y ay ex. a. t_ 


a. a i_y cv i_ 


912 0 0 

J X ii u u 




gctggtctga 


taggaggcag 


agctcaggca 


gtaatgcttg 


cctgctgctt 


acctcatgta 


91260 


60 


gtgcagccag 


gtttccaaca 


gaccacagac 


aggtaccagt 


ccttggcgtg 


ggggttgggg 


91320 




acttatgaga 


atcatgtttc 


tatcagttgt 


tatggcccat 


tatctgetea 


cttgttctct 


91380 
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ccatttgcat 


agataattga 


catttgacat 


5 


gataaacgta 


tcacttttgt 


atgttttgtg 




caacaagcca 


tgctgaagta 


aatgaaacca 








l^ L , - l -yyy l- 


10 


ctcagcgtgc 


tcagccctac 


ggcatcacat 




gtcaaggaaa 


tatacaaatg 


ataaatccaa 


15 


aagaagcaaa 


ggcactaggg 


gtaagtctat 




aggtcttctt 


ataagttata 


ggagagtatc 




l uy a ci ex. i— a 




a u oy a a a l- a l> 


20 


catttcattt 


tagatcaagc 


aaaaatataa 




aatttggtaa 


tattgaegtg 


acagaagcac 


25 


aaggactttt 


tggacaatct 


caatcccatc 




taaaggtgaa 


tttacataac 


aatgaacaat 




ci Lyyy L- cty l. u 


L yyy actci yy ^ 


t-y «y ^y ^y aa 


30 


tctttcacca 


atacccttag 


agatgetaca 




gactctgaac 


tgactgeett 


gtttctgtag 


35 


aagaaaattc 


cacctgccta 


ggtagaattg 




tcattcatcc 


cgctagcaca 


attccaccaa 






L-y V L.y y ay oa 
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40 


agecatctta 


ctatgtattt 


caagttgtga 




tctcctgcag 


cagectattt 


actaagctgt 


45 


agatagagga 


aattccctgc 


tcctaataag 




cagccccaca 


caggtgcttg 


gaatatcatt 




a ^ CI *-j C4. Ca. L« l_ CI 
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50 


ttagtgtgtt 


agaaggtgtc 


gaatgatata 




aagtactagg 


cagtgaggag 


gcagcaattt 


55 


catgacattt 


gagcaaaaac 


ctgaagttag 




aaaacaacca 


gtgtaaaagt 


ctttagatga 




aggaggecaa 


taaaatagaa 


cagaatgagt 


60 


tgagtggcag 


aggecacagg 


tggtgaggcc 
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ttggtaagat 


tttctttagt 


ttcttttctg 


91440 


atacttagga 


cataatgatg 


tcatccaagc 


91500 


tacccaaccc 


ttacccacca 


ggcagcttta 


91560 


caatcaactt 


agaaaaccaa 


gctcagggtg 


91620 


etcegggaat 


etttgetage 


agtcaacegg 


91680 


gtgtgggaac 


agcagtaatg 


aactttaaag 


91740 


ttactaccag 


aattttaatt 


teacattgea 


91800 


atecaattge 


taaaaagtt.c 


tgaacgttat 


91860 


ttagtctgee 


tatagacttt 


cccttgagtt 


91920 


caagacacag 


aggagcctcc 


ctagctggag 


91980 


tgttacatct 


agcttgatct 


catttcttca 


92040 


tctggattag 


ttacaagtct 


ggtttttgca 


92100 


gatggctgee 


cccagaaaag 


gaaactacaa 


92160 


agaattgggg 


gacattccat 


ctgctatgcc 


92220 


gtttaactgt 


caccttactt 


ctacaaagga 


92280 


gcattgtaaa 


aacaaaaatg 


ccctacttgt 


92340 


ttagtccctt 


ctccgtgctt 


tccaaataca 


92400 


attatattaa 


gtctctccct 


gtgaatagct 


92460 


gaatgetcat 


tatgtgccag 


aaactcttct 


92520 


tgttttaggg 


aagtaaaaat 


gagcataagt 


92580 


ggaggtggcc 


cttctgatcc 


tcctcctccc 


92640 


ccaatgggac 


aaactaacac 


cccatacagc 


92700 


aagcacaaag 


gaggtgcaat 


ctctgccctc 


92760 


gactgtatac 


ataataagga 


agtaaagtac 


92820 


gacaaaacaa 


agaagtgtaa 


aagggctaaa 


92880 


caaggtggtc 


agaatgggct 


ttccttagaa 


92940 


gagtgagece 


tgcagatatt 


a ggg aaa g a g 


93000 


aaatgtcttt 


gaggtatttg 


aagaaccggg 


93060 


gagggagggg 


gcagttgtca 


gg a ggg ca 9 a 


93120 


tgcagagcag 


aaggctttgg 


ctttgactgg 


93180 
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gtggatgagg aaccactgaa gcatgctgcc 
tttaaaagaa tgtgaagttt tgttttcaaa 
5 tctgaagatt tgttatttct atgccatcag 

ccacaggtgg caggacttgg cacacctgta 
tggccaaaaa tgcagacaaa ttgtatgtcc 

10 

tgatcagtag atggtacagc cagggctgta 
gatctaactg cccctcatca aggtgacagc 
15 cggctgaatt gctgctctaa agaacagttg 

gttctttaag tgagataggc taaacgtggc 
tatggtatgt gtgtgtatga atgtgcaaga 

20 

tttattgatg tctcaagaaa cttttagcat 
taattcgatt tacttgtgtt cagtagctat 
25 tatttgcagt ctccctggat agataagatg 

acaatcaagt tttggaccaa tctagaaatg 
aatgaaggaa ggttcaataa gaggtacaag 

30 

aatatatgat tgataagaag aatgattggt 
aaaaaaggac aatatacact ttgggaggcc 
35 cagaccttcc tggctaaccc gtgaaacccc 

ccaggcatgg tggcgggagc ctgtagtcct 
gtgtgaaccc gggaggcgga gcttgcagtg 

40 

gggcaataga gcaagactct gtctcaaaaa 
tataaatgaa tccaaaatca agtaataagg 
45 ttcagaaaga aagaaaaaag attaatttgg 

gagcagagct gagcatgggg gaagacagta 
tgtgctgcag taaattccca gatctgcctc 

50 

atctttatgt ctctgggact caatccagtt 
aagtgttcag aacatgacca agcacataat 
55 aattccttta cccgtaactg ggatgaccca 

ctactgacat cacagttaaa gtattgtttt 
gctaatgcat gattaagcag ctgtactttc 

60 

attctccaaa tacctcaaaa cagggcgttt 
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cggcaccgaa 


gaggcgtgtg 


ataaaaatac 


93240 


atctccagtt 


ggcagaagct 


acttcctctc 


93300 


caggccctgg 


gcatcatgac 


agtgcagggg 


93360 


ccagtcaact 


cagaccttaa 


agtgcttctg 


93420 


aaatttttaa 


gtctgtccta 


taggcaattt 


93480 
« 


ggaggagaaa 


tgagggccag 


tacaaatcca 


93540 


aaggatgaga 


atgggaagtt 


tgtgaccaga 


93600 


taaaaaccaa 


atataaagtc 


taatacagga 


93660 


ccatcatctc 


tctgaagcta 


taagcaaaag 


93720 


ataatttttc 


agggagaaag 


gttcatggtt 


93780 


tgtttatgaa 


actgggtttt 


tttctaattt 


93840 


ttattgaaga 


aataaataca 


gcagtctttg 


93900 


taaccatggg 


gaaaaataaa 


ttacaaatct 


93960 


attattcctc 


agaaatctat 


ctttatttac 


94020 


gaggtaaaag 


atgtatacct 


tacttttagt 


94080 


aagaaaatga 


ttgcatacct 


gcaacttttt 


94140 


aaggcgggca 


gatcaagagg 


tcagaagatc 


94200 


gtctctacta 


aaaatacaaa 


aagaaattag 


94260 


agctactcgg 


gagggtgagg 


caggagaatg 


94320 


agccgagatc 


atgccactgc 


actccagcct 


94380 


aaaaccaaaa 


aaaaaccaaa 


aaaaaaacaa 


94440 


tgtgttttat 


gtcatgagat 


gcaactgaaa 


94500 


tgtaatcaag 


aagacttcat 


ggaaaaggta 


94560 


ccgaataatt 


cagacaggct 


cagcagtcaa 


94620 


ttaggaattg 


tgtgaccttg 


gatacctgat 


94680 


gtaggaggat 


tacatgaaac 


aataaatgta 


94740 


tcatgcctaa 


taaatggttt 


tatcttcacc 


94800 


«M ^ C*. ~H d CL NaX 
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94 860 


tctaatagtc 


ttttagaggc 


cctcagagat 


94920 


tgaagaattc 


ttcttgggcc 


accccagtaa 


94980 


cactaagggc 


cttgcactcc 


agttattcta 


95040 
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gtatatgtca gaatcttcat gacccacctc 
atggatctca gcaaaattga caataaataa 

5 

acgtctgaaa gtgaagaagt aattttaaat 
catggttgga ttgatgcaca ttggttttgg 
10 tagagaagta ttaggttttg cctctactgc 

cctttctgtg agtagattgc tagaacacca 
tggtttataa agtattgcta tctccagcca 

15 

catctgaggg ctggagaatt tagaaaaatt 
gaaaaattaa cagctctgaa tgagcatgaa 
20 aggagcttat tccattcttt aaaatttaat 

cctactacat ttgagatgga aatgtatgta 
gattggaaac atgattaaaa atatgcaagt 

25 

cccagctact tgggaggctg aggccaggga 
ctgggcaata tagtgagact caatctctaa 
30 attatgagaa gggatggagc agtaagcctc 

cttcataggc cagaaattaa ggaaaagaaa 
gagttactcc agaaaattca gaagtgcgag 

35 

ataaatagac caacctttca tataaagaga 
cacaaataaa tttgtccatt tccacagttt 
40 tccaaggagc tttcccgttg tctggtaagt 

aagattagct taacatattg gggaggaaaa 
ggagcacctt tcccaaaaaa aatctattgg 

45 

aaatggtcca cacagtttgc cactaagaaa 
tcacaaaggt gagaagtagt aaaaatcagg 
50 ggttcagtct ggtcacactc tattatggga 

tacatactag acacatcagt taccaagtct 
tcttgtattt ttattttttt atttgtgcta 

55 

acaattcttt aaattgccat taaatgagtc 
atgatgagta agacaggact ttgagctctg 
60 cgtggtctac tattgaacca ggaaaaagaa 
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ctgcttcaga 


tgtgtgcctg 


aatgacaacg 


95100 


gcaggggcag 


caatgacagt 


aatgtttcac 


95160 


ggctgttttc 


ttttttcagg 


tgatccagat 


95220 


aattgttttg 


tgtttaatat 


ccttctcttt 


95280 


tgttattggt 


ggatacccat 


tctggggtgg 


95340 


gtccttcttg 


gtttataaag 


tattccctct 


95400 


attcacaact 


catttcttga 


taaagcccta 


95460 


aacagctctg 


aatgagggct 


ggagaattta 


95520 


aaaatcggga 


actctttcgc 


cattttacct 


95580 


catgtaatca 


gcaagccaaa 


gttttgatat 


95640 


ttgggtggag 


agagaaagaa 


aagaaaaggt 


95700 


aagccaggca 


tagtggtgtg 


tgattgtagt 


95760 


attgcttgag 


ttcaagagat 


ggaggccagc 


95820 


aaaaattttt 


taattaaaaa 


aatgcaattg 


95880 


agaacaagga 


atgcttaaac 


attgtgagtt 


95940 


ataaaaaaga 


agtcaaggac 


atatgactga 


96000 


agaaaatttt 


gagaaggaaa 


ctagatgaac 


96060 


aattcccaag 


cacaaccagt 


taacccattt 


96120 


attatctctg 


gctctctctc 


tgtgtcagca 


96180 


tagactgtct 


ctactttttg 


aaccccttta 


96240 


ttgaaaagct 


ggatttggga 


aatgcaaaag 


96300 


ttgccttatg 


gtctgagcgc 


tggggtgaga 


96360 


tggcagagct 


ccatatcctg 


actcttgaat 


96420 


gttgtttaag 


ctttaaccta 


aaagctccct 


96480 


agtccaaagg 


agaatccagg 


agccaggaaa 


96540 


ctactgttca 


tttgccactt 


ctcagctctg 


96600 


tttcctactc 


ttacacaggt 


tgtgtgtttt 


96660 


tagcttgttc 


tgtaagaaga 


gtttataaaa 


96720 


tcggtcacag 


caccccacct 


attccaattt 


96780 


agtgatttat 


caagaactag 


agtcaactag 


96840 
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agcagaggtg 


gtgtcaagat 


tatatgtaat 




gaacacagaa 


tcatgtagct 


catgctcttg 


5 


aattaaaata 


aaataagcct 


aattctgatt 




taaagcaccc 


tttctctgca 


tcttttaagg 


A f\ 

10 


atagattcca 


cctcttagag 


gagacagcca 




agcttttaga 


gaacttcagg 


aggctctaag 




actcgactgt 


gccaccccta 


gcaagtcacc 


15 


catgttaaaa 


tcctattgta 


gaaactgggg 




ccacaaaagc 


ctaaattttc 


tccttctaaa 


20 


acttagggtt 


ttcattaatt 


cctaaatttt 




ccatctataa 


aatcatatgc 


tatcgtacta 




atttaaataa 


tttaacacag 


tgtctgacac 


25 


cattttttta 


taaagaaaat 


atagttgtta 




agatgagtgc 


ctgctttata 


tgatttacat 


30 


tctaggctgc 


atacagaata 


atgtttatgg 




atttcaactt 


gaaaaattgt 


tttgaaagtt 




ttcttatctc 


acttaacttc 


agtgatcctg 


35 


ttaggtgaaa 


ggcagcctgg 


gaatgaacat 




gattctgctg 


ctggtggata 


tgtgcatcaa 


40 


ggtaagtatc 


ccatactcca 


ccatgtgcct 




atgttttatt 


tcttaatacc 


gtataccagc 




cccaccccta 


gtcatatgat 


ctctaggact 


45 


gatagaccga 


ggcttaaagg 


acatccttta 




aatggggaag 


ggtttggata 


ggcctccatg 


50 


caacatttga 


aattctatat 


aaataaggta 




tcagagtaaa 


gtatgaagtg 


agagggaaat 




agaaacccag 


gaatgaagca 


attgctaaat 


55 


d \_» CL CI L~ V_» CL CL \^ 


attcattaac 


cttatataac 




gggtggccca 


agggcaggaa 


tgcttaccat 


60 


tgaggttggg 


ctggaagctg 


tcatcctacc 




attctgtgag 


caatggaaaa 


ccacagaaga 
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ccagtacaag 


aaagaagaac 


ggggaaagag 


96900 


cagacaaaga 


agaataaaat 


aattaaatca 


96960 


aaccactcct 


gttgcagtcc 


aaaaacaaaa 


97020 


atccttccgt 


ctaattgaga 


ttgtcgaagc 


97080 


ctgacacata 


cctaagacaa 


tttttctaag 


97140 


gcacgctgca 


gcacaagcac 


attctaaaga 


97200 


tgcggacttg 


gtttcacaat 


tattcttctg 


97260 


aaagttttgt 


agtcgcccat 


acacctctgc 


97320 


tttgggggtt 


catatttagt 


ccccagagag 


97380 


tcagcaaaca 


tgttgtgtct 


aaatactttc 


97440 


cctcattcaa 


aggctgtaag 


agttgagata 


97500 


agaaaaatgc 


tcattagaag 


tttgtttttt 


97560 


ttgtgcatga 


agcttttagt 


aaaatccaat 


97620 


ttttctacag 


ctttttagaa 


gtaccctaca 


97680 


taattaccca 


gaagggcaga 


aaacagaata 


97740 


caaaaattaa 


gaaccatgaa 


atttgattga 


97800 


tatataattg 


attttttcct 


gccatctcca 


97860 


tgttagttct 


atcttggcct 


tcattggagt 


97920 


tggggtagct 


ggccaagact 


actgggccgt 


97980 


gctctatttt 


cagtcccgta 


aagtgggtct 


98040 


tctagtggaa 


aatgtttttt 


ctattctagc 


98100 


atagaataaa 


gtgagagggt 


tacctctaca 


98160 


ttggcccatc 


tcactgttaa 


tcacatgaac 


98220 


gcttctaaaa 


ttgtccaaaa 


attattttcc 


98280 


gtcaagctat 


tgagtgtttt 


ttgttaaact 


98340 


gacagggata 


agaacatctg 


atttttacag 


98400 


gaagcacgta 


tctctgaatg 


tgctcaaaac 


98460 


• 

caaqcatqqa 


ctctgacctt 


aaqaqqaaat 


98520 


ctataaggaa 


tggagcaaca 


tggtataaaa 


98580 


tcaatggcca 


cattaaccat 


tttggtcttc 


98640 


gtggttttat 


tggtttcatt 


ttattttatg 


98700 
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ttggggttgg ttttgctttt gcttttcagt 
tgggctacat atatcttaag aaagacacag 

5 

aacattaaga caatatcaca atattttgaa 
gttcactcaa tactgtgtaa gtaattctat 
10 aagccatctg cttctcaact aaagggttct 

tgaaagttgt ctaggcactc ccatcagatg 
tcagtagttt gaattgctgg cacactcttt 

15 

aaaaacatag gagagtttta ggccgggaat 
caacactgtt tgcagtgtaa ggaaaagctt 
20 actgctgata aagaaatacc caagactgag 

ccacatggct ggggaggcct cacaatcatg 
tatatggatg ggggcaggca aaaaaaatag 

25 

aaccatcaga tctcatgaga cccattcact 
ccataattca atcatctccc accaggttcc 
30 caagatgaga tttgggtggg gacacagagc 

gaaagcttct cggtaaataa ggcagagtga 
caaacaaaag aacaaaaact attcaaaaaa 

35 

gaaacaaatc attctgatgc tactcaaagc 
cttgtttccc tacaaacaaa actgaggaaa 
40 tctcactaac actctccagt ctgtagctga 

tgaaaggata aagcaaatcc tgtatggtac 
agacagaaga aagggcaaca ccttcccata 

45 

atgagcagaa atgcacagga atgcacagga 
tactaagcat ttagcaggct ctcagaagac 
50 gaactgtgct cgaagccagc taagctatct 

ttcagaaaca tggatataaa acaaccgcta 
caaatgggag ccaaataaac agaagtgtca 

55 

atcaaataga aagcctgtat agagttcaag 
taacatgatt ccattcttat agaatgatgc 
60 tctcacggca tctttccaca catccattct 



ttacaaaaag 


cagagtttat 


taaattactg 


98760 


aatgacttcc 


ttatctatct 


agaacatatt 


98820 


caaataacca 


caataacatt 


tcctttatca 


98880 


ttcactggat 


ctcggagtgg 


caggcttatg 


98940 


gaaatcctga 


ctccaaccat 


tggtataatc 


99000 


cgtgtcccct 


agagtatgtt 


ctttgaagtg 


99060 


ttccaccaag 


ctctgagaca 


atcctttgtt 


9912 0 


ttgaatgatc 


acttttgtgc 


ttggaaagga 


99180 


atataagaag 


gtgtattagt 


ttgttttcac 


99240 


aagaaaaagg 


ggtttaatga 


gcttacagtt 


99300 


gcagaaggca 


aggaggagca 


agtcacatct 


99360 


agcttgtgca 


gggcaactcc 


tgttttttaa 


99420 


gtcacaagaa 


cagcatggga 


aagacccacc 


99480 


tcccacaaca 


agtgggaatt 


atgggagcta 


99540 


caaaccgtat 


caggaggcaa 


aaatagatga 


99600 


agcaagaaca 


aaattcaccc 


tccaaaatac 


99660 


aaacagcttt 


caaactggca 


tccaagtaga 


99720 


tgaaaaaaag 


cacagctttt 


agacctggct 


99780 


gaagggaagc 


caaaaaaaaa 


tactgagaag 


99840 


gaaggctggg 


gtggagagca 


gtcaattctg 


99900 


aagaccatga 


gtagtaccca 


ggacctcagg 


99960 


ccctgcatgg 


taccctacta 


ccagcaggga 


100020 


ataaacctcc 


agccaaggag 


caggcctcca 


100080 


agaaaaagtg 


aagtccatcc 


tcgctagcaa 


100140 


ctctccctgc 


aacagctata 


tatgttcaca 


100200 


gggctcaagt 


tgtaatggaa 


gcaaggaaga 


100260 


atgaaggaaa 


attcacctac 


actgtacctg 


100320 


tgagcttaat 


acttttattt 


aaaatagaaa 


100380 


tcctctctct 


tggttattct 


cttttttccc 


100440 


tctgcctttt 


ccctgtacac 


agagagctag 


100500 
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gataatagtc atcaaacatt aattttggtt 
gttggtttct gtattgcttg acttttttaa 
5 acgaggccat cacaaaggaa aaaaactacc 

tacttaccaa aaggaaagac ttctggcatt 
ctctggaaca agcatttgat gatactggaa 

10 

aatagcacat ttgatgagtc aataactttt 
caaaatttca ctatttattt atgatttcac 
15 tgatcttctc cctcttggag ttcttcgtag 

caaacaccac aaccaatatg gtgagttggg 
cctggagaaa tacagacttg tgtctgctaa 

20 

tgtccttcac tctgatggta ccaggggaaa 
gtttttattc attctttcca gtctgtcctg 
25 gtgacaccag cgtcttcttc agctcctccc 

aaatagtaaa agaaaaaggg gtatcagtct 
cttcttaaga agatgtcttt tattgtctac 

30 

gagatttgtt tttaggttgg tcgctaatga 
acattaccac tactacatgc tggcaaaggt 
35 caactctctt aaagttagaa atgtttctgt 

ttagaaacaa aagccttttt cagtatgttt 
tagagcagta gatctgatct atactccaag 

40 

ctaggtgtcc agtcactgga gttccctaac 
gcaagtgtcc aattgccctt gttggaggtc 
45 atcttgcctc ttaactatca tctacaaaat 

ttaaactcac ctgcaatggg acaacagtca 
ttgagttttg gctttttgtt tttgaattgt 

50 

aaggtaagac agaagggatg aatgacagtg 
aaaacatcac ctaagtcatg aaaaagggca 
55 tcacagatat ccacagcttt gcaatagtgg 

atccctgtca aagcagtagt gatgtcatgc 
gttgaacaga atggtaacaa gggaaaagtc 

60 

acaggattgc caagcggtac agaaggcaca 
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atttctaaac 


actggcattt 


ggaatgatcc 


100560 


tgagtagtaa 


catttgtatt 


aaaaaaaaca 


100620 


ttctttctaa 


tatttctaaa 


caaaatagcc 


100680 


ttaaagaaaa 


tccaaaataa 


attcacactc 


100740 


gaataagcat 


ggagcgagtt 


gagtataagc 


100800 


tgaagcccac 


gtgaggccct 


cctgattagc 


100860 


tttctggaaa 


aggcatttca 


gccacgctga 


100920 


cttgtgccac 


agcccatttt 


gccaaccaag 


100980 


tctcttcttt 


gtaaaataat 


ctgaaaatgc 


101040 


atcctactat 


cggcttacca 


gaatgctctt 


101100 


ttgcatagga 


ttgtagtaac 


catattgctt 


101160 


gttattccaa 


atatgtatga 


aagcaaccct 


101220 


agatgcaaca 


actactcagc 


taatgcccct 


101280 


aatctcatgg 


agaaaaacta 


cttgcaaaaa 


101340 


aatgatttct 


agtctttaaa 


aactgtgttt 


101400 


tggctgtatc 


tcccttcact 


gtctcttcct 


101460 


gaaggatcag 


aggactgaaa 


aatgattctg 


101520 


tcatattact 


ttttccttaa 


taaaatgtca 


101580 


aaaaaattaa 


ttttgatttg 


atattggttg 


101640 


aagattgata 


aacctttgat 


gatgattttt 


101700 


tcatctcctc 


ctcatctttc 


tgttcagaaa 


101760 


atattattct 


atacattaat 


atcatcaaac 


101820 


tttgctattt 


acagagtatt 


ttcatgtaca 


101880 


gacagggatg 


ctgcattgag 


gggttttcat 


101940 


tggcacattt 


ataagccaat 


tagaacaatc 


102000 


taagtttcaa 


gaaatcaaaa 


gcacagatgg 


102060 


gtttctctac 


aaaatagaaa 


ggtataatta 


102120 


caa tciacrcja t 


cccratcrtciat 


qqctcatqtc 


102180 


ctaaggtaag 


ggcacaggta 


gagtcagaat 


102240 


atggaactgg 


agaataaagt 


ggtcggagaa 


102300 


tttgaggatg 


tggtaacaaa 


ttcagagtat 


102360 
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ggacaatgct 


tctgcaaggc 


cttgctgcta 


5 


tttgattttt 


tccagggtta 


agattttgct 




gaaagcttag 


gacattgtaa 


aaaatattat 




ataaggaagg 


a a uaanagca 


ataa uaaaag 


10 


taaatggaga 


aatctgcctt 


ttctccggga 




atatgaatat 


ggtttaaaaa 


tagcaaagta 


15 


aagaagccag 


aagggaaaaa 


tataacaaaa 




gaggaagatt 


agaagaacag 


aatttttgca 




agagccacat 


aau c ccaacg 


4— ^-i —5 +* 4™ /~1 +- 

ucatuggcct 


20 


gattggaaat 


gctgtgtcct 


tcaccataaa 




gatttcatgg 


tctaagggtc 


tggaaaataa 


25 


gtatattgta 


tatctataca 


cgtaaaataa 




cagaagcatt 


tccagcaagt 


tcattttt.ta 




tcaaattgct 


taataat age 


aacaauaaca 


30 


tcacgttcca 


agcactgttt 


caatgctcat 




ttcagatgaa 


aaaactttgg 


cagaaaagtt 


35 


aaatggtgaa 


gcatggattt 


gaacccaggc 




taaccattat 


tcccactatt 


cttatggtct 




acaaacagaa 


atgtctcccc 


acaguaagct 


40 


ctctgagaga 


ggagatttag 


taacataata 




gaactgcgtg 


gctccactaa 


tgtgcatatt 


45 


agaataaact 


gatcagagaa 


acagaggatt 




ggttttagca 


aatttatact 


atgaacaaat 




tagaaggtga 


c tggaauaga 


aatacgatat 


50 


ccctcatgcc 


cagccgaggc 


tggactgtac 




tgcctgattc 


tcctgcctca 


gcctgccaag 


55 


tctgactggt 


ttttgtattt 


tttggtggag 




ctccagctcc 


tgaccgggag 


tgatctgcca 




gacagagtct 


cgctcactca 


gtgctcaatg 


60 


tcggctcgct 


acaacctcca 


cctcccagct 
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gatccaggca 


actacggatc 


caggaaacta 


102420 


agaggagatg 


acaaaatagc 


agaagagcaa 


102480 


ttaaatgatg 


gactgtggaa 


tctaaactgg 


102540 


caataataaa 


gaaaaaagac 


agattgatca 


102600 


gctgtcgtga 


caatcaaatg 


agataatgtt 


102660 


ttgtcccttt 


ttgaaatttt 


aatattatta 


102720 


agagtaggca 


gagaaatcaa 


aatcagggaa 


102780 


tctgggaaag 


cagatgtatt 


atgtcaggca 


102840 


gcctaggtcc 


caactattca 


gcaactgggt 


102900 


gctcaacagc 


tcagcaaaag 


tgaagacaaa 


102960 


atggtgcctc 


aaaatgtggg 


ggaggggatg 


103020 


acaaatataa 


aagtcactta 


ctttagggaa 


103080 


agtaaatttc 


tatcaaatat 


attctatttt 


103140 


gcatatattt 


tagcccattc 


cttgtaccac 


103200 


gacaaccacc 


tgtgaggcag 


gtggatcatt 


103260 


aaaaatcttg 


tcctaagata 


tgtagctagc 


103320 


agtatgtttc 


ccaggtccgg 


gcagttgccc 


103380 


tggatagaat 


cagacacttg 


taatgccaaa 


103440 


tgtaataaat 


ttggaagaca 


ttgccttatt 


103500 


atacagttga 


cacttaaaca 


acatgagttt 


103560 


ttttcaatgt 


gggaaaagtc 


atggaaacgg 


103620 


accaagctat 


actgagggta 


catttgaggg 


103680 


ttctccagct 


gggcctgtga 


tctgagaccc 


103740 


tgccctctcc 


ctctccctcc 


ccctccccct 


103800 


tgccaccatc 


tcgactcact 


gccacctccc 


103860 


tgcctgggat 


tgcaggcatg 


tgccgccacg 


103920 


accgggtttc 


gccgtgttgg 


ccggtctggt 


103980 


gcctcggcct 


cccgaggtgc 


caggattgta 


104040 


ctgcccaggc 


tggtgtgcag 


tggcgtgatc 


104100 


gcctgccttg 


gcctcccaaa 


gtgctgagat 


104160 
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ggcagcctct gcccggctgc caccccgtct aggaagtgag gagtgtctct gcctggccgc 104220 

ccatcatctg ggatgtgagg agcccctctg cccggctgcc cagtctggga agtgaggagc 104280 

acctcttccc ggccaccatc ccgtctagga agtgaggagc atctctgccc ggccgcccat 104340 

cgtctgggat gtggggagca cctctgcccc gccgccccgt ctgagatgtg aagagcgcct 104400 

ctgcccggct gcgaccccgt ctgggaactg aggagtgtct ctgccccacc gccaccccat 104460 

ctgggaggtg aggagcgtct ctgaccgccg ccccgtctga gaagtgagga gcccctcggc 104 52 0 

ccagcagcca ccccgtctga gaagtgagga gcgtctccgc nnnnnnnnnn nnnnnnnnnn 104 58 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnaaaaa aaaaaaaaaa 104 640 
aaaa 104644 

<210> 80 

<211> 95109 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> genomic_DNA 

<222> (1) . . (95109) 

<223> n is an undetermined nucleotide (dATP , dCTP, dGTP, or dTTP) 
<220> 

<221> initial_coding_region 

<222> (60039) . . (60257) 



<220> 

<221> coding_region 
<222> (62556) . . (62678) 



<220> 

<221> coding_region 
<222> (66166) . . (66225) 
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<220> 

5 <221> coding_region 

<222> (67828) . . (67959) 

10 

<220> 

<221> coding_region 

15 <222> (74200) (74313) 



20 



25 



<220> 

<221> coding_region 
<222> (74489) . . (74590) 





<400> 80 
















Illiy l_ l_ LLaaa 






CL ^ ^ 0 ^ ^— i— \_- J 


y \^ v-» v-/ ^ a. ca. c*. 




60 


30 


tacaggtgtg 


agccaccaca 


cctggcccct 


cataaacttt 


ttgtaaagcg 


tcaatagttt 


120 




atgaattcat 


tagtcttcca 


cccgggcttt 


gacatcaatt 


ttatgcttgt 


tctttcttca 


180 


35 


attttagcag 


aattcatgtt 


tctctggtag 


gggctctttt 


caaactgact 


tcttatcctt 


240 




ctaagtgcct 


caaactaaat 


cctgttaaga 


catgttataa 


caattttgtg 


tgagtttatt 


300 




ttggtgcaaa 


aattttttga 


aatccgtgcg 


tagtttttcc 


atagtatgea 


tttcccacaa 


360 


40 


actttttgaa 


gaccccttgt 


attaggttga 


accatttgaa 


attgetgata 


tcagaccatt 


420 




tttaatctac 


aaaattgtca 


gtattgtatg 


atccaatcca 


atgtattaga 


ctattccaat 


480 


45 


atatgcaatt 


tggaatatga 


tccttgtgaa 


atcaagagga 


aaataataac 


atagatatat 


540 




aggttttaga 


atactatgaa 


ttaccaatat 


tgacacagtt 


tttgcactaa 


gtttcctgat 


600 




aggcaaagtg 


aaaatgaaat 


tcaatcatta 


tgtgaataaa 


ttatctatta 


ggcagaaatg 


660 


50 


tttcagttca 


ttatggataa 


aatgattcct 


gtaatagtac 


tttataacca 


tttattcgtg 


720 




tatagttcag 


tataataagc 


agtatataae 


agtatataae 


agttatatat 


atactgttat 


780 


55 


atactttccc 


ccgttctata 


attaaaaatt 


agaatacatg 


gcggggcacg 


gtggctcacg 


840 




cctgtaatcc 


caacactttg 


ggaggctgag 


gcaagtggat 


cacgaggtca 


ggagattgag 


900 




accatcctgg 


ctaacacggt 


gaaaccccgt 


ctccactaaa 


aatacaaaaa 


attagceggg 


960 


60 


cgtggtggca 


ggcgcccgta 


gtcccagcta 


ctegggaggt 


tgaggcagga 


gaatggcgtg 


1020 
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aacccaggag gcggagcttg cagtgagctg 
acagagtgag actccgtctc aaaaaaaaaa 
5 tttctgtttg gagatactga gcattctcag 

tatttcaccc aagaaaactt ccaggagtgg 
caagatggat gcaaggagga agcataggtt 

10 

agaaatcatt atttcattac taagagtaaa 
ctagtcttct ctagatcaat aatagttcta 
15 gagtaatttc tcctgaatga acacttgaaa 

tacttcaatt atccatattc ttataaagtt 
aacacgaaat gttttaacaa cagctcagcc 

20 

aaattatagc attctttatg cctaatgtcc 
ttcacaatat gaattttatg aaatatttta 
25 tctaccctat ttttatttcc tttatctcta 

ggaaccattc ctttgtattt aatataaatg 
tactttatat gattaataca taaaaatgta 

30 

ttggggatag gtatacacct atgaaactat 
cacatcccga aggttcttcc cacacccttt 
35 tattaacatt taacacaccg ctatcccctt 

tcaccacagg cactatgcta tagagtagat 
aatgttgtga cttttaacca tcatcttccc 

40 

actattctac tctctgcttt taagagtttg 
tacagcagat gctggtgagg ttgaggagaa 
45 tgtaaattag ttcagccact gtgcaaagca 

gaactaccat ttaactcagc aatcccatta 
ttccaccaaa aagatatatg cattcacaag 

50 

gacgtgaaat caatccaagt gcccattaat 
acaccatgga atactacaca ggcataaaaa 
55 ggatgcagct ggagaccatt atcctaggca 

gcgtgttctc acttgtaagt gggagttaaa 
acaatagaca ctaagactat caaaaggggg 

60 

atagtgggta caatgctcac tacctgggtg 
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agatcgggcc 


actgcactcc 


agcctgggcg 


1080 


aaaaaataga 


atacgtaact 


ctggttacat 


1140 


tctggatctg 


gttgtcttga 


gttaaggaat 


1200 


tgacagattc 


tccaaagtag 


acataataag 


1260 


gatagcgtaa 


tattttaaat 


gctgactcaa 


1320 


attagcgtga 


agtataagca 


gccccatagc 


1380 


aacatactgt 


actctaacta 


cattatttct 


1440 


tttgcataac 


ctgccattag 


agagaaggca 


1500 


gagactgcat 


aaactaatgc 


ccactaggtt 


1560 


ttcagttttc 


tagaaggtcc 


agtttccaaa 


1620 


tagctacaaa 


taatgatact 


gcatagtttt 


1680 


ctgagattat 


ctcattagat 


ctgttaaatc 


1740 


cactctactc 


tcaaatatct 


cccttcatga 


1800 


gttttttaaa 


accacttttt 


aattttaagc 


1860 


tacatatgta 


atgcatagat 


ctgagtgagt 


1920 


caccaccatc 


aaggccataa 


acacatttat 


1980 


attattattg 


ctttttactt 


tttctttggg 


2040 


agcaatttta 


agcacacaat 


acagttttat 


2100 


ctccagagct 


tagttatttt 


gcataactaa 


2160 


atttatccct 


cccacaagcc 


actgacagct 


2220 


gcgattttag 


gctattatct 


aaaagaaaaa 


2280 


aagggaacac 


ttctacactg 


ttggtgagaa 


2340 


gtttggagat 


tgctcaaaga 


actttaaaca 


2400 


ctgagtatat 


atccaaaagg 


aaataaatca 


2460 


ttccatcaca 


gcactattca 


caatagcaaa 


2520 


ggtggattgc 


ctaaagaaaa 


tgtaatatat 


2580 


agggtgaaat 


catgtccttt 


gcagcaacat 


2640 


act L- Lay u ct*w ci 


y a ci ca. v_.ca.vj ciaci 


cl \ — ' a w*. a i— • 


2 7 00 


cattgtgcac 


acatggacat 


aaagattgca 


2760 


aggaagaaag 


ggggaaaggg 


ttgaaaaagc 


2820 


atggaatcaa 


tcatacccca 


aacctcagca 


2880 
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tcacccaata tacccatgtt tcaaacctgc 
gaaattataa aaaataaaat tatgtaactt 

5 

taaataaaat ggaaaaaaat ttaagagttt 
gctatttgtc tttctgtgtc tggcttattt 
10 ctcacaaatg gaaggatttc tttcttttct 

cacacacaca atttattcac tcatccattg 
ttacgaataa tgctgcaatg aaaatggaag 

15 

ccaaaagtga gattgtttga tcatatgtca 
atattgtttt caataacagc tatagaattt 
20 ctattcacca catcctcacc agcacttgtt 

acaaatgtaa gatgctatct cactatggtt 
gttgagcaca tttgcatata cttcttggcc 

25 

ttgagaaagt catggaataa acttaagtgc 
tggtgtgtat atacatattg tggaatacta 
30 ttttgcagca acatgaatga aactgcggcc 

aaagtcaaaa accacacatt ctcatttata 
catacagagt agaataatac aaattggaaa 

35 

gaggaatgaa atactaccta ttgggtacag 
aagccgaggt ccagacttca ccacttacaa 
40 cccataaatc tattttttta aattgcccat 

tctgttgtat gtgtcctttg tatattttgg 
gcaaatattt tctctcattc cataggttgc 

45 

ctttttagtt tgatgtagtc ccacttgttt 
tcatatccaa aagacatcat tgccaagata 
50 ttctaggagt cttagggttt cagattttaa 

ttctgtgttt gatataaaat aaggctccaa 
tactaacaac atttattgaa gagactaccc 

55 

ctgaaaatta gctggccatc tatgcatggg 
tagtctatat atctgttttt atgtcagtac 
60 tatagtttga aatcaggaag tgtgatgcct 
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atgtgtaccc 


ctgaatctaa 


aataaaattt 


2940 


aaaaaaatat 


tgctgagatg 


gataaataca 


3000 


agattccaca 


tagtgagatc 


atacagtaca 


3060 


cacttaatat 


cctccagatc 


cattcatgtt 


3120 


aaggccagat 


tacacacaca 


cacacacaca 


3180 


gtaacattta 


gcttgttcca 


tatctgacta 


3240 


tgccaaattt 


caaatccttt 


ggctatatat 


3300 


gttctatttt 


taacttttct 


agggacctcc 


3360 


acattctcac 


caacagtgtg 


taagtgttcc 


3420 


atcttttgat 


ttttaataat 


agccatccca 


3480 


ttgatttaca 


tttccctgat 


tattagtgat 


3540 


atttctatga 


cctcttttaa 


gaaatgccta 


3600 


ccatcaataa 


atgattggat 


ttttaaaatg 


3660 


ctttgctgtg 


gaaaagaatt 


aaatcatgtc 


3720 


attatcctaa 


atgaaatgac 


tcagaaacag 


3780 


agtgggagca 


aaaaagttgt 


tacacacgga 


3840 


ctccaaaagg 


tgagagtgtg 


gaagggagtt 


3900 


tgcacagtat 


ttgggtgatg 


agtacactaa 


3960 


tatgttcatg 


caacagaact 


gcacttgtac 


4020 


ttttaaattt 


ggttgcttta 


ttttttttat 


4080 


atattaaatc 


ccttatcaga 


tatatggttt 


4140 


tttttcattg 


tttttcttac 


tgtgtggaag 


4200 


atttttgctt 


tgttgcctat 


gctttacagg 


4260 


aatgttaagg 


atctttctct 


ctgtgttttc 


4320 


aataaattac 


tttattcatt 


tttacttaat 


4380 


ttcatccttt 


tccatgttaa 


tatccagttt 


4440 


tttccccatt 


atgtagtctt 


tctgcctttg 


4500 


tttatttctg 


ttctctctgt 


tctgttccat 


4560 


cactctgctt 


tctttactat 


agctttgtaa 


4620 


ccagctttgt 


tcttgctcaa 


gattgctatg 


4680 
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gctattcaga 


gtcttttgtg 


agtccataca 


aattttaaga 


ttgttttttc 


tatttctgtg 


4740 


aagtatactg 


cttttgatag 


aatagacatt 


ttaacaatat 


tagttcttcc 


aatctatgaa 


4800 


catgggatat 


ctttccattt 


atttgtgtct 


actttaattt 


ctttcatcaa 


cattttatag 


4860 


ttttcagtgt 


acaactttgg 


catttccttg 


gttaaatgta 


tgccagtgtt 


ttcacaattt 


4920 


ttctctgtta 


aacatcattt 


tgtgtgtgtt 


tattgttttc 


ctggttttgt 


taaattgtct 


4980 


ctatgttctc 


ttgcagctca 


ctgatgttct 


ttaagatgat 


tattttgaat 


tctttgtcat 


5040 


gcagttcata 


gatatccaat 


tttaaaggtc 


aaccacaaga 


gcttcacttt 


gttcttttgg 


5100 


tggtgtccta 


tttccttgat 


actttgtgtt 


cctcaaagct 


ttgcattact 


gtcttcacat 


5160 


ttgaagaagc 


agtcaatgtt 


tctaatcttt 


actggttttc 


ttcagtagag 


aaagaccttc 


5220 


acagtgagcc 


cagccagaga 


ttcttggggc 


ctcatgtact 


ttttctataa 


tgtgcacact 


5280 


ctatttctct 


tgttccttct 


tagtgggcaa 


gtctcaggat 


tgtgtgcctt 


ctttcaatcc 


5340 


aaagtcaggc 


ctgatgctga 


gagcctgcca 


tttattttcc 


ctaaggcaat 


gtcctgaagt 


5400 


tttaaaggtt 


gtgcatcttt 


tttttttttt 


tttttttttt 


ttttttttga 


gatagggtct 


5460 


cactctgttg 


cccaggctgg 


agtgcagtgg 


cacaatctcg 


gctcactgca 


acctctgcct 


5520 


cccaggttca 


agagattctc 


ctgcctcagc 


ctcccaagta 


gctagggcta 


caggggcatg 


5580 


ccaccatgcc 


cagctaattt 


ttttgtattt 


ttagtagaga 


tagggtttca 


ccatgttagc 


5640 


caggatggtc 


tcggtctcct 


gacctcaggt 


gatccgccct 


cctgggcctc 


ccaaagtgct 


5700 


gggattacag 


gcgtgagcca 


ccacgccctg 


cctcatcttc 


tttcaaagca 


gcagtgtcaa 


5760 


gccagttggt 


aagatccaca 


gctgctgttg 


agatccaagt 


gctgtctgtg 


ggagctcatt 


5820 


ctgagctgtg 


catgggcatg 


tgtggggtac 


tattcaaggt 


ggggttccca 


gggtgctagg 


5880 


agaatgtgtt 


ggctatttgg 


gaagccctct 


gacaagtcat 


tctgcagggt 


ttatgggcag 


5940 


gcctcttggc 


ggagttcaag 


agctggttat 


cccagtggaa 


tatgtttgta 


attgcagtgg 


6000 


aatgtgtttg 


taaaacatct 


ttggttttca 


aagttctgca 


gcatgagcat 


atgaggtacc 


6060 


tgagcaggta 


tcatctccac 


tctacaaaag 


aataaactga 


ggaccaaagg 


atttaagtgc 


6120 


agtgctgtac 


ccaccacatc 


ttgcagagga 


ttctaagtgg 


caaggagctt 


ctgacaatga 


6180 


tgctgcttct 


aagcgggggt 


gtgtctgggg 


ttagcagtgg 


ggccaccacg 


tgatcaatcc 


6240 


gccgacagat 


caagcacagc 


ctcttcctcc 


tgggtgcgcc 


aggcccaagg 


acagaggatg 


6300 




o.y \»- v»c*yy c*. c*. a 
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cctcatctta 


63 60 

W *J W \J 


ggagcctgca 


ggctccgtgg 


tgccacggga 


gccccaccgc 


acctcacgta 


tccgcgcgag 


6420 


tctctagcac 


agccccgctc 


ttgggcgtta 


gaaccccgcg 


aggggggatg 


aaggacgcca 


6480 


ccgaggaagc 


agaccccacc 


gggctcccga 


ggctgccagg 


ccccccgctc 


a gggcgctgc 


6540 
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ctgcagaacc acgcccccct gtccctgtgc 
ggcctggtta ccattcgcga ctccctcgtc 

5 

ctagtaacca tggtatagtt gagacgatcc 
aggcgtctag gaggcttacc tggcattgcc 
10 aaacaccagc ggggctgttg gccttgtgag 

gttctcagca tctgcccaat ttgagctcct 
ttcctaatgg ttcaggtggc cataagatat 

15 

aagatctctg tttatcacct tgagatgttc 
tcacatagcc tagattttaa gtcgttgcat 
20 agcttgctct ttaggacagt agtaataaaa 

gttacatact agacactttt ctaaattatc 
ctcctatact aagacctcaa atagcatgaa 

25 

actgcaggaa cacatagaaa cttcgttgta 
ccaaaaaaaa aaaaaaatgt accatgcaca 
30 tatgacaaaa acatacaaaa aaatgactta 

tggagtatgg tttgagaaat tagcctggaa 
cttgtatgtt tgcagactat tctaaagtca 

35 

tatgaagaat tgctgtggct gagatgtggg 
atgaatatta gaggacagat gaccgaataa 
40 tgagctcctt gcgaactgag ccagtgatct 

cctcaatata gtaacagatt tggatttgga 
tggttcagct gggttggact ggctttgggg 

45 

gcagtgaggt attcaagttt aacaatactt 
cactgaggaa tagaagttta aattaggaat 
50 gctcatggag aaggttagag gagagttcca 

tcattcctaa gatataagac ttatgtcagt 
tgacatcagt gaaaatggca cagtggggaa 

55 

ttcagaagca acattctcac aactctgaaa 
gcttgatcaa gaaaacacaa ggaaatgttg 
60 cctagtctca tttcccactc cccagctcgt 
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taagagggtg 


acgcgagtgc 


ccgaagccct 


6600 


ctgggttctc 


ccattgccag 


gaatgtttta 


6660 


actcttgtcc 


tacgcatttc 


ggcacaatac 


6720 


tgagagaatc 


ttccatccat 


ttaagtaagt 


6780 


ttgtggcacc 


gtgaaaaacc 


aaggcctgca 


6840 


gtttgaaacc 


actgttgggt 


ttcccttctc 


6900 


cctcttaagc 


ttgagtgtca 


gttattttat 


6960 


tgcagggtcc 


ctgaagtgaa 


tttcgtgtat 


7020 


tccagactta 


ccattcaaat 


ttggagtttc 


7080 


ctacaatcat 


gactacattt 


attgcacaat 


7140 


tattaatact 


aattttatga 


atgccatgag 


7200 


atgaaatcca 


gtcagtttag 


ctgtttgcaa 


7260 


cttatacatt 


tatttgacag 


tcactgagct 


7320 


gtgcttagaa 


tctacagtta 


caaagatgaa 


7380 


ttggtggaac 


tgaaactgac 


cagaaatggc 


7440 


aggaagctgg 


agggcacatc 


gtgaaagcct 


7500 


gtagggagcc 


tctaaaatgg 


tatgatgata 


7560 


gaataggctg 


aaggagcaat 


gaaatgaata 


7620 


gaaatcttca 


taatagtcca 


agcaaaaaaa 


7680 


ttgtctacca 


tttctatctc 


tacacccacc 


7740 


gttttccatg 


ttttaatggg 


ttggttgatt 


7800 


gaagactaaa 


gggg^ggaag 


ttggaaggga 


7860 


cagttggatt 


tctgggcacc 


attttaattc 


7920 


gggacaaaag 


ccaagtgcag 


gatatggaaa 


7980 


atgcagtgag 


catgtgggaa 


cccaaaagac 


8040 


gaaagtaatc 


taaaacgaca 


tggttagagg 


8100 


caccaagcaa 


ttaataagct 


ggaaaaaact 


8160 


attaacgaaa 


acttgcagca 


accaagggat 


8220 


gaaacagatt 


tttgtgacat 


tttaatttat 


8280 


cagtgccctt 


gaataaaaca 


gccttcattc 


8340 
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tctgtatagg 


tacccagtaa 


cagagggagc 


aaagcaaatc 


ttatcctcaa 


agaactgcag 


8400 


ctgcttgtta 


gatctacctg 


gtggttccat 


agagaaactg 


ctcagagaac 


ctgcctttac 


8460 


ctcgcctaaa 


acagaactat 


cccggagctc 


aggtaagcag 


tggcatttgt 


caaaaacatg 


8520 


taaaggaaaa 


aacataatag 


tcattgattt 


ctgaggcaag 


gaataatggt 


tgggacaaat 


8580 


aatagactaa 


ctgaaaagtt 


tgggagaaaa 


ggccagggaa 


tgagatgctt 


tgagtaacaa 


8640 


actcccacat 


atttctgata 


atctagaagg 


ccacagacat 


gccctgggct 


agacacatgc 


8700 


tcagaaaaga 


cctgaaaaca 


tcctaaggtc 


tcaccttcgg 


ataatgttca 


gactctgcac 


8760 


aagtaggaag 


taaaggataa 


gcagagtggt 


ctttaatgca 


ttttgagttg 


atttttgtat 


8820 


atggtgtgag 


aaaagggtcc 


aatttcatta 


ttctgcatgt 


ggatacccag 


ttgtctcaac 


8880 


accatttatt 


gaagaggctg 


tcctttctcc 


attgtgtgct 


tttggcatct 


ttgccaaaaa 


8940 


tctgttgacc 


ataaatgtgc 


aggtttattt 


gtgagttctt 


cagtctgttc 


cattggtcac 


9000 


tgagtctgtt 


ttcatggcaa 


taccatgttg 


ttttgattat 


tatagctttg 


taatatattt 


9060 


caagattata 


tagtgtgatg 


acttcagctt 


tgttcatttt 


gctgaagatt 


gctttggcta 


9120 


gttggggtct 


tttgtgattc 


cacacaaatt 


tcagattttt 


tctctatttc 


tgtgagaaat 


9180 


agattgaatt 


ttgatagaga 


ttgcattgaa 


tccatagact 


gatttgggta 


gtatggatat 


9240 


ttcaacaata 


ttaactcatc 


caatccatga 


acatgaaata 


gcttttcatt 


tatttgtgtt 


9300 


ttcttccatt 


ccttttatca 


atggtttgta 


gttttcagta 


tataaatcat 


ttaactcctt 


9360 


ggttaaattt 


gtttctacat 


attttttata 


ctactgtaac 


tgggtttgtt 


tccttaattt 


9420 


attttttgga 


tagttcattg 


ttagtacaaa 


gaaaaccttc 


tgattcttgc 


atgttgattt 


9480 


tttctttttc 


tttttttctt 


tctttttttt 


tttttttttt 


tttgagatgg 


agtcttgttc 


9540 


tgtcacccca 


ggctggaatg 


cagtggcatg 


atcttagctc 


actgcaacct 


ccacctccca 


9600 


ggttcaagtg 


attcttatgc 


ctcagcctcc 


tgagtagctg 


tgattacaga 


tgtgtgccac 


9660 


catgcccagc 


taatttttgt 


atttttagta 


gagacaaggt 


ttcaccatgt 


tggccaggct 


9720 


ggtctcgaac 


tcctgacctc 


aagcgatcca 


cctgccttgg 


cctcccagag 


tgctgggatt 


9780 


atagacatga 


gccaccacat 


ccagccgcat 


gttggttttt 


aaccttgaaa 


ccttactgaa 


9840 


tttgcttatt 


agttctaata 


gttttttggt 


ggaatctatg 


ggattttcta 


tatataggat 


9900 


catgtcatca 


tgtctatggg 


gttttctata 


cataggattc 


atttcctatc 


tggatgccaa 


9960 


tatatccaaa 


aaatcfaaaca 


taaaatcaaa 


caacrat t at t 


ctcrtaaactt 


cracrafc t taac 


10020 


attctttacc 


cagttgagtt 


tttctttgtt 


gctatttctc 


ccttttagaa 


tgaaaatatc 


10080 


tatcctatgc 


ctgtcacact 


attgtgtttt 


ggaagcacaa 


aacttattcg 


atttcacagg 


10140 


cccacagctg 


gagggaaact 


tatctcagga 


tgaatcatgc 


cttgaatctc 


atgcctatat 


10200 
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ctgatttaga tgatactctt gactttggac 
tttgggtctc tagggataga atgactgcat 

5 

ctatgggata aatgctatgg tttgaaggtt 
ttgccgtttt atcaatatta agaggcaaga 
10 tgctctcatg aatggattaa tgccttatca 

ttcattataa aaggaagagt ttggtcctct 
cctttccacc atgggatgac acaacagtaa 

15 

tagacttccc agcttccaaa actttgagcc 
acagcacaaa atggactaat acaacttgag 
20 atgttccatt agtacttgag aagaatgtgt 

atttgtatat taggtctatt aggtctacag 
ttttctgtct ggatgatctg tccattgctg 

25 

ttgaattgca gtttattgct cccttcatat 
tatggtgctc caatgtcggg tgcatatata 
30 gaccccttta tcactatata attactttgt 

tcttgacaca tataagatag ctacccatgc 
tccatccctt cacttttagt ctaagagtgt 

35 

tccatagagt tgggtcttat ttgtttattc 
tttaatctat ttacatttaa ggtaatgttt 
40 gtatttgttt actgactgtt ttgtagttcc 

gtgatttgtt gaatttttgt agtgatatgc 
gctataggct ttctttgttt tgttttgggt 

45 

tagggtacat gtgcacaacg tgcaagtttg 
tgctgcaccg atcaactcat catttacatt 
50 agccccctga caggccctgg tgtgtgatgt 

ttcaactctc acctatgagt gagaacatga 
tgctgagaat gatggtttcc agcttcatcc 

55 

tttttatggc tgcatagtat tccacggtgt 
tcattgatgg acatttgggt tggttccaag 
60 acatacgtgt gcatgtgtct ttacagtagc 



ttttgagctg 


ttgctggaac 


aagttaagac 


10260 


tttgtatatt 


aggacatatg 


ttttggggga 


10320 


tgtcccctcc 


caaatcgtgt 


tgaaatttag 


10380 


cctttaagag 


tgattcatcc 


atgatgtctc 


10440 


ctggagtggc 


tttgttattg 


agggaatcgg 


10500 


tttgcctctc 


tatctctcac 


catctctttg 


10560 


gacccttgcc 


agatgccaac 


tcctcaatct 


10620 


aataaattac 


acagtctgtg 


gtattctatt 


10680 


aggagcctaa 


tatatgatgt 


atcctggaga 


10740 


attctgctac 


tattagaaga 


aatgttcttt 


10800 


tgtagttcaa 


gtccaggatt 


tccttaatga 


10860 


aaagtgggat 


attgaagttg 


cctactatta 


10920 


ctattaatat 


tatatatatg 


cacacattta 


10980 


tttacaattg 


ttatgtcctt 


ttgatgaatt 


11040 


ctctttttac 


gattttttac 


ttaaagtcta 


11100 


actggtctcc 


ctttgtgtgg 


aatatctttt 


11160 


ctttaaaggt 


gaagtgagtc 


tcttataggt 


11220 


attcaaccac 


tctgtctttt 


gattggagaa 


11280 


gacaagtgaa 


gagttagtat 


tgccattttg 


11340 


tttgttcttc 


caatcttgct 


ggtttccttt 


11400 


tttgattttt 


tttttatctt 


tttcacatct 


11460 


tttttttggt 


ttttctaata 


ctttaagttc 


11520 


ttacataggt 


acacatgtgc 


catgttggtt 


11580 


aggtatttct 


cctaatgcta 


ttcctccccc 


11640 


tccccaccct 


gtgtccacgt 


gttctagttg 


11700 


ggtgtttagt 


tttctgtcct 


tgtgatagtt 


11760 


atgtcctggc 


aaaggacatg 


aacttatcct 


11820 


atatgtgcca 


cattttctta 


atccagtcta 


11880 


tctctgccat 


tgtgaatagt 


gccacaataa 


11940 


atgatttata 


atcctttggg 


tgtataccag 


12000 
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tagtgggatt gctgggtcaa 
tgtcttccat aatggttgaa 
5 ttgtccacat cttctccagc 

ctggtgtgag atggtacctc 
atgagcattt tttcatgtgt 

10 

tcatatcctt tgcccacttt 
tttgtagaat ctggatatta 
15 attctgtagg ttgctgctat 

tcttgtagat atacctattt 
ctacactttt acaccccata 

20 

aaagacgaaa ataatgggag 
taaggtcatt acagaagaat 
25 ttgctgttta gcagagaata 

gctaagcagt ggaaaatcag 
gtattacttg ttctctatat 

30 

tatacaatat ctctgttaaa 
tgggggtaag gactaagtgc 
35 agaaaaagga aaagacattg 

cctaaggtga tgataactag 
acattccgga atagttcaac 

40 

tgtttgctga tatatgagag 
gctcatcaat cataaagtgg 
45 ggggcactgt actgtcaaca 

tcaatgatct cccgctcagc 
aatatttgtg ggaccttgaa 

50 

aggaaaattc tcaacgtatc 
acactatcag aatgcagggc 
55 taacgatgtt agctagggtg 

acaatgaata gaggcaaaaa 
ttagtactgt catgccccat 

60 

aagcctatga atggcccaga 
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atggtatttc 


tagttctaga 


tccctgagga 


atcgccacac 


12060 


ctagtttaca 


gtcccaccaa 


cagtgtaaaa 


gtggtcctat 


12120 


acctgttcgt 


ttcctgactt 


ttttatgaac 


accattctaa 


12180 


attgtggttt 


tgattcgcat 


ttctctgatg 


aacagtgatg 


12240 


ctgttggctg 


cataaatgtc 


ttcttttgag 


aagtgtctgt 


12300 


ttgatggggt 


tttttttttc 


ttgtaaattg 


gtttaagttc 


12360 


agccctttgt 


cagatgggta 


gattgcaaag 


attttctccc 


12420 


aggctttctt 


ttgtatttac 


catgaggctt 


aaataaaaaa 


12480 


taagctgatg 


acaacccaac 


ctcacttgca 


tacaaaacct 


12540 


ccacatttta 


catttttctg 


ttacagttta 


cttctctagg 


12600 


tgggtcagct 


acatcaaaag 


gtatctactc 


agtctcattt 


12660 


aaaatctaag 


aggtagcttt 


tggcttttgc 


caggaagtta 


12720 


gtgtcagtgg 


gttgatgcaa 


aaagcttttt 


gcaaggaatt 


12780 


aggcttccat 


ttcaagtgcc 


ttatctcaaa 


ataatcatta 


12840 


ttctctatag 


caaaggagtc 


cattcatcct 


ctataactgc 


12900 


atgctgagaa 


gatttatcct 


aaaagaaggc 


accaaagcaa 


12960 


tacttgggtt 


aataaatgac 


tagtgttgct 


tcttcctaat 


13020 


accacatctt 


ccccaaagga 


aggagaattg 


acaaaagtgt 


13080 


cctaaagcta 


attggactca 


aatgtgacgg 


acccaaaaaa 


13140 


gttaccacaa 


tccacccagg 


gttctaacaa 


atacattctg 


13200 


gtccattttc 


tgcctgtgtc 


tgtaaacagt 


ttgaatcaaa 


13260 


tcaggtaatg 


ggcttaaaac 


cagaaaaggc 


tgtagaccct 


13320 


tgtgcccctt 


acattcagct 


caggtgggtc 


acattctgct 


13380 


tctcctcaat 


cttagaatac 


tatacccttt 


tgtattatcc 


13440 


tcacatttgc 


atttaagtca 


gcaacattaa 


accttgcact 


13500 


aaatatatta 


ttatctccag 


ctggttgaga 


cctacccagt 


13560 


atacctacat 


gcctctgagt 


caacaggcaa 


aaatgagagt 


13620 


atcaatrraa 

S* V-' C*. C*. O Vw *^ CT-*H 


CSS t" csccsscscs 


aaaaattaaa 


ccactactt"c 

V— ' (A \m* W l_» W 


13 680 
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tgaatatgtc 


tggaatacag 


gagatccttt 


aagatgtctc 


13740 


gattaaggtc 


aatgggaaat 


tacaacaacc 


caatccaggc 


13800 


ctcttcagga 


ttaaaggttt 


gggttatcct 


gtcaggtaaa 


13860 
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gaaacatgac 


cagccaaggt 


tgcataatac 


5 


gcactgtagc 


ccctttctga 


ggcattcctg 


ggacagaact 


tcaagcagta 


cacgaggttg 










10 


cttgaaacga 


atatgattag 


aaaagtggtg 




tagacttctg 


aataggcaaa 


acacataaag 


15 


agagcatgat 


tctcctgcct 


cagcctccca 




tgcccagcta 


atttttgtat 


ttttagtaga 




1~r , 1~ fn^arfr 1 

L- L. L U- y O.O.U 


*w *w ^-j d. l_ L~ CI ^— j 


v-4 Q. l_ \— » i. v — ' 


20 


aggcatgagc 


caccatgccc 


agccagcaga 




ggcccattct 


gtggatacca 


ctcagcctct 


25 


gcccattaac 


aaagtggcca 


tggtggcagt 




ccacatgggc 


ttccactcac 


caaggctgac 




LMVyL. d ^—j \_» CI w 


ciy d- ci. i_ a. d. ' 


a 1 1 a era ttcc 


30 


cctggtggca 


gaatggctac 


attggaccat 




ttactggaat 


agatacttat 


tctagatatg 


35 


caaaaactac 


catccatgga 


cttacagaat 




agcattgttt 


ttgatcagcg 


aactcacttc 




ar , t~f~ , a , t"crt~aa 


tttactaatc 


ttactatatt 


40 


ggatgatgga 


atgaactttt 


gaagacaatt 




taactggggg 


aaggttcact 


agcagagtgt 


45 


aaatcaggga 


gtgaaaatgg 


gagtgacacc 




aaaaatttgc 


ttcctgtcct 


gaaacattat 




acraaciaaa1~ci 


cttctaccaa 


aacracataat 


50 


cccaactagc 


cattttggac 


tttttatctc 




tgctggaagg 


ggttatcgat 


actgactacc 


55 


taaggaagtg 


tatattagta 


atacaggaag 




cctgtgatta 


aggtcaatgg 


aaaactacaa 




ccagactctt 


caggagtgat 


ggtttgggtc 


60 


gaggtgcttg 


ctgaaagcaa 


aggaaatatg 



ccaggcacaa 


ccccaaagta 


gacagctgta 


1392 0 


aagaaaataa 


tgaagaaaaa 


tcttcccagt 


13980 


tatattttgc 


ttggaaggag 


aaatagcegg 


14040 


tggccaatgg 


tgtggctgga 


atgtcaggga 


14100 


acataggagt 


ctagagaaca 

< 


ggtttgtgga 


14160 


atatttgtgt 


cccatgttaa 


tgtccaccaa 


14220 


agtagctggg 


attacaggtg 


cacaccacta 


14280 


gatggggttt 


caccatgttg 


gecaggctag 


14340 


cgcctcggcc 


tcccaaagtg 


ctgggattac 


14400 


ggaagatctt 


aataatcaag 


tagataggat 


14460 


ttcctcagcc 


accccatcat 


tgtccaatgg 


14520 


ggcagggatg 


gaggttatgc 


atgagctcag 


14580 


ttggctacag 


tcactgatag 


gtgcccaaaa 


14640 


tgatatggta 


ctattccctg 


gggtgattag 


14700 


ttctgtcatg 


gagagggcag 


tgttttcttc 


14760 


gttttacctt 


ccctgcattc 


aatgcctctg 


14820 


gecatatcta 


ttgccatggt 


aatgccacac 


14880 


agagcaaatg 


aagtgcaaca 


ataagctcaa 


14940 


ccccaccatt 


atgaagcagc 


tggtttgaca 


15000 


acagcaccag 


ctagctggca 


atatcctgea 


15060 


ttctcccaca 


gctaggattc 


acaggtccag 


15120 


actcactatt 


acccatagtt 


acctactagc 


15180 


actgtgcttg 


tcagatgtct 


ttattccaat 


15240 


aatgatttca 


ttgaactgga 


agttaggact 


15300 


tgaatcagtt 


ggcaaagaag 


ggagttattg 


15360 


ataaggaaat 


ttggcctatt 


acttcacagt 


15420 


ccccctaggg 


tatcctcagt 


attaccatgt 


15480 


caacccaatt 


agaacaggac 


tactgatggc 


15540 


aacccaccaa 


gtcaataatc 


acagcctgct 


15600 


gaatgggtag 


tgggagaaaa 


taatcataaa 


15660 
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taccagctat aaccatgtgg ccagttacag 
ttccttattt tgttatgaac atgtttgtgt 
5 tccctctctt atccccttat tatgtaacat 

aagttgtggt atttcaacag aagagtaaac 
aaaagtgtta gtgcattttt tgttgtatgc 

10 

ctttggagat taagtatggt ttcacgagat 
gtggacatga agttgacttg tgatgggtaa 
15 atgcctagat agttgttcaa acattattct 

cattaacatt taagtcgctg gtctttgagt 
attttattca atcagttgaa ggtctgagtg 

20 

tccctcatcc ctcccctcat ggagcaagag 
atctgcacca tcagctcttc cgtatctcca 
25 ttgccagcca aaatttacag agtacatgta 

caagtctaaa tcattttaag atgtttgttt 
cccagaacca cagggaaaat tgtgacccat 

30 

gtattcaaag gcaaaagcgc aggtatgagg 
ctagataaga ggcaagcagt tgcatcctct 
35 acaatgtacg catgacaggg gggtacagga 

aaatctgcct tttcacataa gataaaataa 
atttgtctca ggtgagcaga gagatgactg 

40 

ataaggtcct aagatttatt ttccttttac 
tatataaaat atatagggtc tgtttctctg 
45 tctcaatacc agaacagaaa cccatatatt 

attgaagtag ctttaattca cttaaaacta 
ccaggggata aagtagttgc agggcagatg 

50 

ttcggtggtt gcacacatcc tagtcactgg 
aataaagccc agctccaagc aggccccaga 
55 caggagtcac tagagggcct gcttgcctgc 

agattggtta caaaggtcag cactggttgc 
tccaagtcac aaacaagaga ggatgaggga 

60 

gattcacagt ggagaacaag agctgagtct 
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aaataaggac 


tgtaattgtc 


atgagtattt 


15720 


gtttgtgtgt 


attacatacc 


tttgttttct 


15780 


aagatgtgtt 


gactttatgt 


tatggctttt 


15840 


gtcactcaag 


aattttgcat 


cctcttctgg 


15900 


atgacagctg 


ttaggcaaaa 


gtacatcttt 


15960 


gtgtaagtgg 


actgctaagt 


ggactgctaa 


16020 


ttttatgtgt 


caacttatct 


ctgtcaaaat 


16080 


ggctgttcct 


gtgagggtgt 


ttgtggatga 


16140 


aaagcagatt 


gcccttcata 


atgtgggtag 


16200 


aaacaaaagg 


ctgaccttcc 


cccctccatt 


16260 


gaattttcca 


gaagactgcc 


ttcaagcttc 


16320 


gcctgctggc 


ccacactata 


aattttggac 


16380 


cagatacaga 


gtaaatcaaa 


acgactgagg 


16440 


gccaaggtta 


aggacatgcc 


tgagaaagaa 


16500 


actttttctg 


aagagggtct 


ggagacctca 


16560 


aaagaggaag 


aaatttttta 


aagggtgtga 


16620 


tgagtgtttc 


atcagccatt 


ccctgaatac 


16680 


atagttactt 


atacatttgt 


ctagtccaat 


16740 


acatagggca 


gaggaagcaa 


tcagatatgc 


16800 


agttctgtcc 


ttcgtcccac 


acctgtgaag 


16860 


taacatatat 


attatacaca 


catatatgta 


16920 


gagaactctg 


aataatatac 


ttttacataa 


16980 


ccctatgaac 


atgaatttta 


aggtaatctg 


17040 


cctaaataga 


tcagaaagaa 


agaatgggca 


17100 


atgttgggtt 


ttaatctcca 


atacggcccc 


17160 


tgtgcgcagg 


tggctctact 


ccggctggaa 


17220 


gaaccacttc 


acataagagc 


tatagaaaac 


17280 


v_ i_- i~* v-j cl a. \~< 
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actataatcc 


1734 0 


aggagcaggc 


agagcagaga 


gaaactgagg 


17400 


agtaaggcaa 


gatcaaaagt 


tcctatttgg 


17460 


acaggctgga 


gctggcctca 


gagaaccaac 


17520 
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catgggacgg 
ttttgttgtg 
ctagaaaaat 
tgtttaaata 
tgctttctct 
acattctcct 
taataatgtt 
tgcactggga 
tcctatccct 
accatggtat 
atgggctaaa 
gaaatgcctt 
tggaaaatat 
ctacaaggct 
tacagaggca 
cacaggagga 
tattataata 
agaagcagaa 
tacccacaac 
taacacattc 
cccagagaag 
tcttatagtg 
agtgtatgta 
aatcatctag 
gccatagata 
ctccaagttc 
ccatttgaca 
cagcaatgaa 
tgacaggatt 
tcatcttttg 



agacaaatat 
tgtcctgggc 
aaatattagt 
catgaggaga 
ctgacacaca 
aacctctcag 
ggctctctct 
actctctgaa 
ataattgctg 
tattgctttc 
gacccaatct 
atg'tttgttg 
attaaaggag 
tatcttttgc 
atttttagag 
atttggggaa 
attccatggt 
agtttaccca 
tttgtacttt 
ccttttgctt 
ggcctggtga 
ggagtcctca 
ataaagacaa 
aaagaatcct 
ataattactc 
attttacctt 
aacagggaaa 
gtgtcagagg 
tcattttcct 
ttattgccgc 



caaggagaac 
caaacaaata 
tgtctctaaa 
ccgcaccacc 
attctttcac 
tcacccagac 
tctcacaggc 
ttcattggat 
tagtaattta 
aattcattta 
ctaacttaaa 
tatattttaa 
gatgcttgaa 
agaatgaaaa 
agcatgtata 
attaaacaga 
agcagttgcc 
aggttaaatt 
tcaaccaaaa 
tgtgaaaagg 
tctctgctct 
agtggaagac 
atcatttaaa 
cagcccaaga 
aattttatac 
aattgccacg 
ctgaggcttg 
taaaaacaaa 
gggctgaagg 
aggcagattg 



cttaaggatc 
ggcctctgct 
ggttgctatg 
taactgtggc 
tactgcttca 
tgtaatcatc 
tgtgcaaata 
tttgctaaga 
ctcatttttg 
tttttttcta 
ggagcccatt 
taattttctt 
gaataataaa 
atccaaaggc 
cagaaaaaaa 
gatttctaaa 
aaaactccca 
tctccataga 
acagggcttc 
cctctgagac 
ggataaaaga 
atttacaaac 
ctgaaataat 
tatgaaagag 
ctattatgta 
acaaccccat 
gagaacttaa 
caaacacaag 
atccccagtc 
aagacaggta 



ctcagagcca 
ttttacttaa 
aaacaaaagc 
tgccatttct 
gaagtaccag 
tgtggtagat 
atgattggcc 
gaggattcaa 
tcattatttt 
ttgaattcag 
tttgtgtgaa 
gagaggttga 
gagagtggct 
cttagcttaa 
aaaaaaagga 
gtagatgttc 
ccaagggcat 
agagagtttt 
aagggagaga 
atccctgtac 
cattattttc 
accgagaact 
acaattaatc 
gagcatatga 
tcaggcatct 
gagatagcta 
gtgactctca 
ctaagatgat 
cccttcttag 
tatgatttac 



acagctctcc 
ctagaaccaa 
aaatactccc 
ctattcagcc 
ggtaaagcaa 
ttgacttgag 
tgattcacag 
gtggttctgg 
ttgtgagtaa 
gtgtctagaa 
tacagactta 
ggtgcctata 
atttgggtaa 
tgtctggggc 
gacctagagg 
tgaataaggg 
gcccaattac 
attacctcct 
agatttagta 
agtcatttca 
tctaaggctt 
ttctggaaag 
acttattgca 
cagatattta 
ccaaaaggca 
tcattcatct 
cagagtgaca 
tctgccttca 
tatgtttgaa 
tggaaaaaaa 



17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
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ttaattcttg gtattccaga aacaacaatc 
cacttagggt atgatatttg tggaaggcaa 
5 aaactattcc cactgatgaa atttttttta 

agtgagagcc accacactag ccactaggaa 
aataaatgac aattaaaaca agtgaagtgc 

10 

gctccctaag tcccatcacc tatgcaaagg 
ttttcatccc tataacagag aaatagtgaa 
15 ctacagactt cgaatggggt ttttcatgtt 

gccactttgg atattaggaa tgccactggc 
cataaggcat tgtgtgttta taaatgctta 

20 

cagtagtgta atacagaagc ctgttacgag 
ctctaaagat taaaatgtta cttggtgcaa 
25 aacatttttg gggaaaaaaa acttttccaa 

ttgtaaagat atcctccagt gcatcctctg 
agcaataata ttgtatacag agacagacgt 

30 

ctatctcaac gagcatatat aaccaaataa 
aggctggtgt gcagtgggtg atcacagctc 
35 atcctccagc ctctgcctcc caaatagctg 

tttttcattt tttgcagaga taagggtctt 
tggcctcaag gcatactcct gccttggtct 

40 

actaaaccta gccaccaaac aaaacttcga 
aaatgtaatt aggataagat tagaaaatta 
45 aataatgtgt gaaaagagaa ttgccttcag 

cacaaaccag atatcatgac aacacatcta 
tccctcctgt ctctcacagc cccccttgtg 

50 

ctgaattttc caattctgac tgccttacta 
actctgtgcc tcactctcct catttaaaaa 
55 caaagggttg ctgtgaggat taaattagta 

cacagggtgg ctcatgccta taatctcagc 
ctgaagtcgc gtgttcgaga ccagcctggt 

60 

atacaaaaat tagctgggtg tgatggcgca 
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ctttcaatta 


gtgccagaag 


gtgatgatag 


19380 


cctataaatc 


ttcagatgta 


atgttttctg 


19440 


taacattatg 


ggagaaacgt 


agttcttgca 


19500 


atattttcta 


ataattcatt 


tcaagggatc 


19560 


ctacaacttg 


tcctcataaa 


atcattgtga 


19620 


tctcattaat 


gaccggagac 


tttaaaacat 


19680 


tccatgcttt 


agaaattgta 


ctagcggtca 


19740 


aaaatcactt 


tcttcagaaa 


tgcattcagt 


19800 


tctagggaaa 


gaagctattg 


aggggattat 


19860 


tttctatccc 


cagttcatca 


actcaggatc 


19920 


atataaggta 


tgtcagaata 


ttcggagcaa 


19980 


acaaacaaaa 


aaagaaatta 


ttgctgggcc 


20040 


atgtttaaat 


gaaaaaaaat 


tatttaattt 


20100 


tgcatgaatg 


ttggaggagg 


atacagtgtg 


20160 


gagagtttcc 


atgaaataaa 


tcatttcaag 


20220 


aacttaagac 


aaggccttac 


tctgtcaccc 


20280 


aatgcaacct 


ggagctcctg 


ggctcaagtg 


20340 


ggactgcagg 


tgtgtgccac 


cacacctggt 


20400 


gctatgtttc 


ccaggctggt 


tttggactcc 


20460 


ccgaaagtgc 


agggattata 


gacatgagcc 


20520 


ccctgagaat 


caaagcattg 


ggaagcgaag 


20580 


gtgtctagtt 


aaggagaatg 


gataagacaa 


20640 


caaaaccagg 


tcaggaaata 


tcaattagaa 


20700 


acgaaggatg 


gtcctttttc 


ttttagaact 


20760 


attaggatca 


cagctctgga 


ggcagatagc 


20820 


gtgcctgtgt 


aaccctgagc 


aaattacttt 


20880 


ataaaatgtg 


gttaataata 


gtaactgcct 


20940 


tgtaagcata 


aagaacagta 


cctqqqqcca 


21000 


cctttgggaa 


gccaaaggca 


ggtggatcag 


21060 


caacatggca 


aagccccgtc 


tctactaaaa 


21120 


cgtctgtaat 


cccagctact 


caggaggctg 


21180 
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aggcacgaga atcgcttgaa cctgggaggc 
tgctctccag cctgagtgac agagcaagtc 

5 

cacaaagata ccactctgct tatcaaaacc 
ttccccaggg gaaaatctgc tttcagggac 
10 atatttcaga taaaggggtc ctcctatttt 

tgagaggcca tttttctcag ccccagattg 
cacttttttt aacttctatt cccaaggtag 

15 

ctttttctaa gctttttttt tttttttttt 
tgtgacttag aagtcacctg gtgttgcttt 
20 taagtacaga tgacatgaaa gcccataaag 

tagcattctc atagtttagt ttctgtattg 
acccaaagaa acagttgtga ataaactcag 

25 

aaaagttttg ggttttttta ctcgaggatt 
agttccagtc acaacctaga tccctttgta 
30 tctgcccttc tttttccatc tccccactcc 

atatgtgtgt gtgtgtgtgt gtgtgtgtgt 
gtcaggcatg ttctatgtat gtaggatgca 

35 

ccttagggag cttacattct aatagcagga 
gggaaataat ataatatgtt agaaggtgat 
40 gggctgccat agtttcgtct tgaacatttg 

agatatttaa agccaggaga cttgttgaga 
aaagagaaac aagaactgag ccctagggca 

45 

aaaataataa tttttttact tgaaaagttt 
tattatatct tattcgtgtg catgttttca 
50 tgtttttgtt tgaatttttt tttttgtttt 

tcttgctctg ttacccaggc tggagtgcag 
cctcccaggt acaagcaatc ctcccaactc 

55 

ccaccaccac gccctgctaa tttttgtatt 
ccaggctggt ctcaaactcc tgacctcaag 
60 tggaattaca gtcattagcc accgtgcccc 
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aaaggttgcc 


gtgagctgag 


atcctgccac 


2 124 0 


tccatctcaa 


gaaaaaagaa 


cagtaactgg 


213 0 0 


cgctgagaaa 


aaacaggtgg 


attctcctct 


21360 


tggtaagaga 


agaaattgaa 


attgtatttc 


21420 


ttctttcagt 


agaatagggt 


cagagtggta 


21480 


gaagctataa 


ttatgggcat 


taagctatta 


21540 


ccactgttaa 


tatcattttt 


cattctaggc 


21600 


tttgtaattt 


gtttgcaagc 


agagtgaaca 


21660 


tccacggaaa 


accttacatc 


cacagagata 


21720 


cccccatgtg 


atttggtaaa 


ctcccagaac 


n w—w a >~i 

21780 


gtgcatactg 


tcccaaattt 


tctccacaac 


21840 


ccccttgact 


gattttcaca 


aaaactcaca 


21900 


ggtgggggaa 


M J ■ J * 

gattatgatt 


gtaatttcca 


21960 


actctctccc 


taatccccat 


cactccgtct 


^■y f~\ y-\ 

22 02 0 


acctcaatca 


atcaatctat 


acatatacat 


22080 


gtagatatat 


atatatataa 


tttgctgcat 


2214 0 


tcagtgaaca 


aacagatgaa 


aatttattac 


2 2 2 0 0 


taaaagcaat 


aacaataaac 


aggacaaaca 


2 2 2 6 0 


aggtgtgatg 


aaaaaaagaa 


aacgtaataa 


22320 


gacggattca 


gttgagaatc 


ttagacatga 


22380 


tctccagtgg 


aaagaacaca 


acaaagacga 


22440 


ctccaacatg 


aaaaatagaa 


aaaaggggag 


22500 


cttggtttaa 


tcatgttcta 


aagagtaaaa 


22560 


agttaaagca 


gaaaataaat 


gtagaatata 


22 62 0 


ttgcttgttt 


gtttgttttt 


tgagacagaa 


22680 


tggtgcagtc 


tcagctcact 


gcaacctctg 


22740 


agcctcccaa 


gtagctggga 


ctacaggctc 


22800 


tttagtggag 


acggtgtttc 


atcatattgg 


22860 


tgatctaccc 


acttcagcct 


cccaaaatgc 


22920 


accagaatat 


atgtcttttg 


taccataatt 


22980 
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tttaaattgt 
gagttcagga 
ctttagctgt 
agtttttagt 
catcacctca 
tctccccaac 
gacatttcat 
tcaatatatt 
tatgactaaa 
ttgcatttac 
tgaaagatct 
gacacaaaca 
atggccatac 
atcattttta 
ccaaatagca 
acttcaaagt 
aaacaggcac 
aaaccaactt 
cagtaatggt 
aagcaattgc 
gcacagcaaa 
tttgcaaact 
ttcaacaagc 
ttcttaaaaa 
tgtcaggaaa 
attactaaaa 
actgctggtg 
gaactaaaaa 
acacngngnc 
caaagacacg 
atatctctat 



tttttcaaac 
acttttttct 
ctatgtaagt 
atattcactg 
aaaagcaact 
cttaggcagc 
taaaaatgaa 
gtggtaaaag 
catttctata 
aatagccaca 
ctacaaggaa 
aatggaaaaa 
tgcgcaaagc 
gcacaattag 
aaagcaatct 
atactacaag 
atagatcagt 
atattcaaca 
gctgggataa 
aacaaaaaca 
aaaaagaaac 
atacgtccta 
aaaaaacaaa 
gaagacatac 
atgtaaacca 
agtcaaaaaa 
gaaaagtaaa 
taaaactacc 
ncncncncnc 
cgctcactcc 
atatacatat 



aatatacatt 
gaaaacatat 
attatcttaa 
atatgtgcaa 
ctgtacccct 
cactaatcta 
tcacatcata 
ttcactcatg 
caccataaca 
aaagtaataa 
aactagttaa 
cattttatac 
aagctacaga 
aaaaagtatt 
taatcaaaag 
ggtatagtaa 
ggaaccaaat 
aagtcaacaa 
gtggccatcc 
aaaattgaca 
tattaaacta 
caaaggacaa 
taaccaaatt 
aagcagccaa 
aaaccaaatg 
tagatattgg 
ttagctcagt 
acttgaccca 
ncncncncng 
cggtgcccct 
atatatatat 



aagtgacttc 
atatttattt 
ttgattcttt 
ttattaacac 
tggctatcac 
catagggtct 
tgtaatttct 
ttgtagcata 
ttcaagctga 
aatacctaga 
tacaaaatat 
tcatggatta 
ttcaatgcta 
tctaaaattc 
gaataaagcc 
ctaaccagta 
agagaatttg 
aaataagcaa 
atatacagaa 
aatgggacct 
aacagacaac 
atactcagaa 
aaaaagtggg 
caaacatgaa 
agataacatc 
tgagaatgcg 
ccctatggaa 
gcaatcccag 
ngngngnggg 
ctctggcgat 
atatatatat 



cttttggggt 
tcatgtaaac 
tcttaagcat 
agtccattca 
tcccatcccc 
ccatatattt 
tgttaactgg 
tatcagtact 
gagtcaaatc 
aatacatcta 
tgcttaaaga 
gaagaatcaa 
tttctatcaa 
atatggaatc 
agaagcatcc 
ccatgctggt 
gaaatacatc 
ttgggtaagg 
taataaaacc 
aattaaacta 
ctgcagaatg 
tctataagta 
caaatgacat 
aaatgcttaa 
tcataccagt 
gacaaaagga 
agcatctgga 
tgcgcgtgtg 
tgcgtcgtcc 
atgggtatag 
atatatatat 
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gcagatattt 23040 

ctgggaaaat 23100 

acaactcaat 23160 

aaaacatttt 23220 

aacttcccat 23280 

tcctactcca 23340 

cttatttcac 23400 

ttattctttt 23460 

aagaatgcaa 23520 

accaaggagg 2 3 580 

aatcatatat 23640 

aatcattaaa 23700 

accaccaatt 23760 

aaaaaagaac 23820 

acattacctg 23880 

actggtgcaa 23 940 

tgcacaccca 24000 

attccccatt 24060 

gaaccctcaa 24120 

aagactttct 24180 

ggagaaaata 24240 

acttaaacaa 24300 

gaacagtcac 24360 

tatcactaat 24420 

cagaatggct 24480 

acacttatac 24540 

aatttctaaa 24600 

cgccccnggg 24660 

cactctntnn 24720 

aaaaagagat 24780 

atatatatat 24840 



-289- 



WO 02/062946 



PCT/US01/48437 



atatgcatac 
caatatggat 
aatgccagtg 
aagaaaatag 
actacctctt 
aaaatacacc 
ttttttttaa 
ctgcacaccc 
gagtccctat 
tggaccctct 
taatacaagg 
cattgagtct 
ggttgagaaa 
gaattttcac 
tttactgata 
ataaatcaca 
gcacctcaca 
caaagctgtt 
tctcacaata 
tgacctttat 
cattgttatt 
taaatatgtg 
ttgggcttct 
gtcatagtga 
atcggtgacc 
taagataaca 
cccctgtctc 
ttaggccaat 
atctctctct 
gtgagatagg 



catggaatac 
gcagctggag 
gggttataat 
acattgatga 
gggtactaac 
caaataacaa 
agaacgtgaa 
acaaccaact 
tcaataatgg 
gaatgccttt 
ttgaatagat 
ttcaccaata 
gatcccatct 
caaatgcttt 
tggcatatga 
cttgcttaca 
gataatgagg 
tcttcaacat 
tttcaaactt 
gattattata 
gttttggcat 
cgttctgact 
ctattttcta 
tctgtgatca 
tttgatatta 
aatttaattg 
tctccatctc 
taataaccat 
taagtcaaaa 
ccaaagttaa 



tacacagcca 
gccattgtcc 
aagtgggagc 
atgctactgg 
tgggtgatgg 
aactgtacat 
tagatattgt 
ggtattcaac 
tgctgggatc 
tgggttttct 
ataatgagag 
ggtgtgttag 
attttgagta 
gtctacatca 
cattaattgt 
gtgtacaagc 
ttttttacaa 
cacgtgctca 
ttttatgatt 
tttgtcatag 
accataaact 
gctctaccaa 
atatacaaca 
gtgaccttta 
acattgttat 
ataaatatgt 
cttgggcttc 
acaatggcct 
gttataaatg 
tcctcttgca 



taaaaaagag 
taggtgaatt 
aaaacgttgg 
gggaaggcag 
gatcaatcat 
atgccccctg 
tttatttcta 
aaagtcaaca 
attggccatc 
ttactgccga 
tatatctcac 
gtgtggattt 
ttttgagtgt 
attgagatga 
ataagataaa 
atacctcgtt 
agtgaaggtt 
cttcatgtct 
attatatttg 
tgatctgtga 
gtgtctatat 
ccacacttac 
acattgaaat 
tgattattat 
tgttttggca 
gtgttctgac 
tctattttct 
ctaaatttta 
attaagctta 
cccaaccatt 
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tgaaatcatg 
aatgcagaaa 
gtacatatgc 
gagggaagaa 
accccaaacc 
aatctaaaat 
tagtttattc 
aaaataagca 
catacacaga 
actgccctgg 
cgtcttattc 
ttattagatg 
ttttatcata 
tcatatgtgt 
accagccttg 
ttattgtgct 
tataataaca 
ccgtgtcacc 
tcatagtgat 
ttggtgacct 
aagacaacaa 
ccttgtctct 
taggccaatt 
atttgtcata 
taccataaac 
tgctctacca 
gagatacaac 
agtgaaaaga 
tgaggaaggc 
aacataggtg 



tcctttgcag 
cagaaaacca 
acataaagat 
agttttaaaa 
tcagcattat 
gaaagttgaa 
agaaataaat 
atggggtaag 
ataataaaac 
ttagaatttc 
ctcatgaaag 
ctccttatca 
aaacagtgaa 
gtttttctat 
catttgtgga 
ttgctatatt 
ctgcatcaag 
ttttagtaat 
ctgtgatcag 
tcgatattaa 
atttaattga 
ctccatctcc 
aattatattt 
gtgatctgtg 
tgtgtctata 
accatcctta 
aacattgaaa 
agagccacac 
atatagaaag 
tgaatgtaaa 



24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
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ggaaaaattc ttgaaggaaa ttaaaagtgc 
acaaaacaag cttgttgctg atgtggagaa 
5 ctacagcatt tccttaagcc aaagcctaat 

attctgtgaa gaccgagaga gatgaagaag 
ttgattcatg aggtttaagg aaagaagttg 

10 

agcaagtgat aatgtaggag ctgcagcaag 
tgaaggtggc tgtattaaac aacagatttt 
15 gaagacgtct tataggactt tcccagctag 

acaggctgac tctcttgtta gaggctaatg 
tcatttacta tactgaaaat tctagggccc 

20 

gtaaatggca taagaaagca tggataacag 
attttaaatt caatattaag aactactact 
25 ctgctcattg acagtgcacc tggtcacatg 

tttgcagtga tgaatcaagg agtaatttgt 
ttttgtaaga ctacagctgt catagctagt 

30 

ttgaagagct tctggaagat tcaccattct 
ggaggaggtc aagatagcaa cattagcagg 
35 gaataacttt gaggggttca agacctcggc 

agcaagaagg gaagcctgaa gatgtgactg 
tggataagga gttgcttctt atggatgagt 

40 

ctccagtgaa gatgctatga acattgctga 
aagcttagtt gataaagcac tgtcaaggtt 
45 ttgtatggtg gataaaatgc tatcaaaaaa 

aaagaagagt tgatgggtgc agcaaacttc 
ggaccccaac cttcagtaaa caccacccta 

50 

gaccctccac cagcaaaaac attacagctc 
tagcaataaa atattcttaa attaagtaca 
55 acacttaata gacgacaata aactgtaaac 

agtctctctc tgtcccccag gctggagtgc 
cgcctcctgg gttcatgcca ttctcctgcc 

60 

acctgccacc acgcccggct aatttttttg 
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tgcttgagtt 


aacacacaag 


tgataagaaa 


26700 


agttttagtt 


gtctggacag 


atcaaaccaa 


26760 


ccctagcaag 


gccctaattg 


gctcttttca 


26820 


cttcagaaga 


agagtttgaa 


gctagcaagg 


26880 


tctccataac 


ataaaaatac 


aaggtgaaac 


26940 


ttatccagaa 


gatctagcta 


aaataattga 


27000 


caatgtagat 


aaaacagcct 


tatattggaa 


27060 


agacaatgct 


tgggctcaga 


gcttcaaagg 


27120 


cagctgatga 


ctgtaagttg 


aagctaatgc 


27180 


ttaagaatta 


atctactctg 


cttatggtct 


27240 


cacatctgtt 


tacagcatga 


tttacagaat 


27300 


caggaaaaaa 


agcttccttt 


taaaatatta 


27360 


tgttttcatg 


tctgttaaca 


taacatccaa 


27420 


actttcaagt 


cttgttattt 


aagaaataca 


27480 


gattcctctg 


atagatctcg 


gcaaagtaaa 


27540 


agatgacatt 


aagagcatta 


atgacataca 


27600 


actttggaaa 


aagttgatac 


tacctctcat 


27660 


ggaggaaatc 


actgcagatg 


ggatggaaat 


27720 


aattgctgta 


atctcctgat 


aaaactttaa 


27780 


aaagaaagtg 


ctttcttcag 


gtggattcta 


27840 


aatgacaatg 


aagaattcag 


aatattacat 


27900 


ggagaggatt 


gactccaatt 


ttggaagaag 


27960 


cattgcatac 


aacagaggaa 


acttttggga 


28020 


atgttatcta 


atcttaagaa 


attgccacag 


28080 


atcagtcagc 


agccaacaac 


atcaaggcaa 


28140 


actgaggctt 


aggtgatcat 


tagctttttt 


28200 


tttttagaca 


taaagctatt 


gcacacttgc 


28260 
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agtggcacaa 


tctcggctca 


ctgcaagctc 


28380 


tcagcctccc 


aagtagctgg 


gattacaggc 


28440 


tatttttagt 


agagatgggg 


tttcgccatg 


28500 
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ttagccagga 
gctgggatta 
aactaaaaaa 
tgaacctgca 
gtttgctaac 
gttttctt'tt 
gaacgagtta 
ttctttaaat 
agaagatctg 
ttcttcctta 
ctaatttgtt 
ggtttgtagt 
ttctttagtc 
gttcattgat 
ttactacttt 
atcttaagat 
tttacagcta 
gctgtatctt 
taatcccttg 
ttgtgttatt 
gccttttaaa 
ccatgtgcag 
tctgttaggt 
tgccttgttg 
aaattgtctg 
ttcttagaaa 
tttttttttt 
ctctgctcac 
agtagctggg 
agatggggtt 



tggtcttgat 
caggtgtgag 
ttcatctgtt 
atatctccaa 
attttgttga 
gttgtgatgc 
ggaagtgttc 
atttggtaga 
attataaatt 
gttttagtag 
ggcatataat 
aatgtttcca 
tcattaaagg 
tttatctact 
atttccttct 
ggaaagttag 
taaatttcta 
cattttcatt 
gttatttagg 
aattttaatt 
tttattaaaa 
ttcagaaaac 
ttagttggtt 
ctctactcat 
tttctttctt 
atatatgttt 
ttgagacgga 
tgccagttcc 
accacaggcg 
tcaccatgtt 



ctcctgacct 
ccactgcgcc 
acgttattaa 
agtatgcctg 
agatttttgt 
attagtctga 
cctcctcttc 
agtcaccact 
taaatacctt 
tttgcatgtt 
tgttcatagt 
cattcatttc 
cttttccatt 
gtttttctat 
gctttcttca 
gttattgatt 
tctaagcact 
catcgcaaag 
agtatgttgt 
ttgttccatt 
tgtgttatgc 
cgtgtattct 
tataatgttg 
cattaaaagt 
tgtttctgtc 
ataaccatta 
gtctcactct 
gcctcctggg 
cccaccacca 
agccaggatg 



tgtgatccac 
cagccagcat 
agtattcatt 
catatttctt 
gtccatgttc 
ttttgatctc 
tattcttaga 
gaagtaatct 
aattctttat 
tctaggaact 
attctttcat 
tgattgtact 
ttgttcatct 
tttgcttcta 
ggtttgatat 
tggggtcttt 
aatttagctg 
tattttctaa 
taatttccac 
gtacttagag 
cctaaactct 
tctgttgttg 
ttcaaatctt 
gggtcattga 
tggttctgct 
taaaatgttc 
gttgcccagg 
ttcacaccat 
cgcccagcta 
gtctccatct 



ctgcctcggc 
aacttttata 
ttattgcagt 
tttctgtgtt 
ataagagatc 
agggtaatag 
agagtttgtg 
gggtctgcac 
agtctattca 
tgtccatttc 
aatcacattt 
aatttgagac 
ttcaaagaat 
aataattttc 
gggttttttt 
cttctttgtt 
catcctataa 
ttttcttttt 
atatttgggg 
aaaatacttt 
ggtctattcc 
ggtgaaatgt 
ctatttcctt 
agtccctaac 
ttgcgtgttt 
ctctctatct 
ttggagtgca 
tctcctgcca 
attttttgta 
cctgacctcg 



ctcccaaagt 
tgcattggga 
ggcct'gaatc 
gctgtattga 
atggttttta 
tggcctcaca 
aagaaggtaa 
ttttcttcat 
gattttctat 
agcaggttat 
atttctataa 
ttctctattt 
caacttttgt 
actttaatct 
tattttcatt 
agtataggca 
gcattgatat 
atttcttctt 
tttcccaaat 
actatttcta 
aaagaatgtc 
tctatagatg 
gtttatcttc 
tattattgtt 
caagcctctg 
ctagtagcat 
gtggccccat 
cagcctcccg 
tttttagtag 
tgatctgccc 



28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
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gcctcggcct cccaaagtgc tgggattaca 
ttaaaatata ttttgtctgg tattagtata 
5 ggattacaaa tatttttcta tttttttgtt 

tgtgtctccc atagacagca catagttgga 
tgcctttaat tgaattgttt tattaattca 

10 

tgcatttgac attttacttt ttctccatgt 
gactgcttta ttttttggct aagtggatat 
15 ttttttactg tatatttcag ttatttcatt 

acttatcaaa atctgcttca catttatact 
atccctgtat agctctattt caatacctct 

20 

tgaatgttac aaacataaga taacattatg 
aaagaagcaa agagaagaaa aggaacaagt 
25 tatctattta ccattacaat ttattattta 

ctcctcattt gttcctgtgg gttcaaattt 
ttccttagct taaaatacat ttgctcctct 

30 

attgcatttc cctatgttat agacccaaca 
tgctttttaa ataagttagt agaagaaaga 
35 atttgtataa ttttcttttg tcagtgatct 

tgtagtcact tgcttttggt ctgaagaatt 
caacaaattc tttcagattt tgttcatctg 

40 

aagacagctt tgctagaagt aggattcttg 
ttgaatttgt tatcccacta ccttctgggt 
45 aatcttattg gagttccctt gtatgtctca 

ttctctttgt tggctttcag cgtttttact 
agtttcttcc ttgggatttg ttgagattcc 

50 

atatggaaaa ttttcaggca ttatttcttt 
cccctttctg gtactttcat tatgtatatg 
55 taagtctcta ttcatatttc ttcattcact 

ctatcaatct atcttgaaat tcactaattc 
gccactctaa cgaatttttt attttagctg 

60 

ggtgttttaa aaatgattta tatatcttta 
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ggcatgagcc accatgcccg gccaacattt 30360 

gccattctag ctttcttgtg tttgttcttt 30420 

ttcattcttt ttgtgtcttt gaatctaaag 30480 

tctggttttt gtatccaatc tgacaatctc 30540 

catttaatct tatattgaaa tagttaggct 30600 

tttatgtttt ttgtttctct attctgcctt 30660 

tttctaatgt agtattttaa tttttaataa 30720 

aattgtcgct ctaaggctca tcatatttta 30780 

aatttaattc cagtgaaata tagaaacatt 30840 

ttatgtgata ttattgatgt attacatcta 30900 

attattactt tatataattc tatctctttt 30960 

atacatttac agtttttgtt atttaccatt 31020 

ccagaaattt tttatttata atttcttgtt 31080 

aacatctaga gttaaactcc tagatgttga 31140 

ccccccactt ttgctgttat tggcaaatat 31200 

atacgttaca tacatattgt ttcatatgat 31260 

ggataaattt gcatttacac tttcttttat 31320 

ttatattttc atgagattca aattactgtc 31380 

ttctttagta ttttttgtaa ggcatgctag 31440 

caaatgtctt tattttgttt tcatttttga 31500 

gttgataggt cttttttttc tttgaaccct 31560 

ttcaatgttt ctgattagaa gtcagctgtt 31620 

agtcattttt ctcttgctgc tttcgagatt 31680 

gtgatatgtc tacttgtgga tttctttgtg 31740 

taggcggtat agattaatgg ctttgaataa 31800 

gaatatattt tctgcttctt tctctttctc 31860 

ttgtttcctt aatggtattt cacatttctc 31920 

tttctctctg tctttcagat tgcataatct 31980 

tttcttctgc cacttcaaat ctactgttgt 32040 

ttataatttt caactccaga atttccattt 32100 

ctgatattct ctatttgatg caatattgtt 32160 
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atcacacctt cctttacttc cttaatcgtg 
tgactatttt gaaacatttg tttattaaat 

5 

cctacagttt atatcatcta tgggtcaaac 
tgcctattat ataattgttt tgctggaaac 
10 tctgggtact catcccccac accaggactt 

tctaggttgt ttcagtgaag tctatttctc 
cttatagggt acaggcttgt gatgcccaca 

15 

ggctctttgt ctctttccct aaccataccc 
tgttctgagg caaaaactgc ttcacagact 
20 gatagctccc aaaatcagtg tttgagattt 

tccccttttc ccatttctct ctgggaaact 
atgtatctgt gaacacctac caagtgcctt 

25 

cttgtaagcc taaacttctc cacatgctac 
ggagatttct attccatagc ctatttctct 
30 aggggtgagg aagtgggtag ggacaacgga 

gaatggggga ggggacagaa gccccagttc 
cttaccccaa agccagaaaa aggtcaatta 

35 

gagtaaaagc tgggtagaag agatcaccca 
ctcagtgtta gtatttgggc aggaccagaa 
40 aagatagtat agcactctga ctgggatctt 

ttccaatcta gagtggtctt gctgatctga 
cagataccat accagaaact cttacagcat 

45 

cctttatttg ctatattcca ttaaaaaggt 
gatttttgaa ataattttta caaatttggc 
50 atgctggaag tgggactctc caaaaataat 

tattagaatt tttaggaaca tagatacctt 
atcctcatta atggacatag gataagtgtg 

55 

ctttgtacta aggatgaatc tttttagttg 
acaaagctaa aattgaaaat cctaatcaca 
60 catgtattaa atgtattatt tatacatgac 



gcttacttaa 


gtgctcaaac 


atctttataa 


32220 


attatatctg 


gtctctcaca 


gtttttgttg 


32280 


ttgcctgttt 


cattgtctgt 


tttatattgt 


32340 


tgggcatttt 


cgataatata 


ttatagcaac 


32400 


attgttattt 


acttgattag 


aaaaaaactg 


32460 


cgtcctcact 


gttaagcttt 


gatgttgccc 


32520 


gtcaccctgg 


gatgacattt 


atcttggaaa 


32580 


agctgttaag 


ctctgctagt 


gcttgctggt 


32640 


aacccaaaca 


aatttgagct 


cctttgaggg 


32700 


gttctgactc 


caaaagggct 


cttcctagct 


32760 


cgtgggccta 


cagtttagcc 


tgtatctaaa 


32820 


ttaccacaac 


ttccactgtt 


tttgagagta 


32880 


aaatgaagtc 


acctcctttg 


ggaacagagt 


32940 


cccaacatct 


ctgagctaag 


gatctggtgc 


33000 


gtctttctct 


ttaagagttg 


agggtttggt 


33060 


ctctcaactt 


gcctctcatg 


atgtgaaacc 


33120 


gagcccagta 


ttatagagtc 


ttcatcccat 


33180 


tctcttggct 


gccttcactt 


ggtacttacc 


33240 


aggctaatta 


gatggcctac 


ccttcctgag 


33300 


tgagaaaagg 


gagccctgtg 


tttttggcta 


33360 


gaggagggag 


ggagggtgca 


gatcttggtt 


33420 


ttactgagta 


gattttcttg 


aataaatgtt 


33480 


ttacagagta 


cagaggtttt 


aaaggattaa 


33540 


tggggagctg 


gttgagttcc 


tcccattctt 


33600 


aatgctgtac 


acttaattct 


ataaataatt 


33660 


cttaagtgat 


cctggttcat 


aagcctctct 


33720 


cagaggcatt 


ctaagcatac 


cacaatctga 


33780 


acaactccag 


cagcatagtt 


ctacatattc 


33840 


atatactaga 


ttattcctcc 


tttagtatgc 


33900 


taatacatat 


atgtattaat 


acatgacatg 


33960 
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tattaaatgt 


attaaatgta 


ttaaatttta 


ttaaatgaaa 


ttgtgcttgt 


aattatatct 


34020 


cccccattag 


attaaaagct 


ccttgagtag 


aaacctgagg 


tttagtacat 


tgtaggcatt 


34080 


gtataattaa 


tgaataaata 


aatattccct 


ctgaaaaaaa 


aattatccta 


ctagcttcta 


34140 


gacaaatttg 


aggtggactc 


ctgatatcaa 


cctctgtgac 


aatcaaagaa 


agtcagattt 


34200 


ttaaaacttt 


aacttgaatg 


ttctagccag 


taggacagac 


tgattgette 


agcaacatct 


34260 


gtttatgctg 


tccataacca 


atttttcttt 


gtatgtctat 


tacaccttac 


tttagcaatt 


34320 


gtaatgaaca 


gcataactat 


ttgtgtgact 


acacttggta 


taattctatt 


aagcatggag 


34380 


attgcaactt 


ttgcattaaa 


tgcaaccaca 


tacaattggt 


caaaegtaag 


taccaatttt 


34440 


aataataatt 


agatcaaaat 


tttaatataa 


atgaaaaatt 


atggtgaatg 


atgtatgttg 


34500 


aagtgagtaa 


ggcatagaaa 


aggccaccaa 


atgtgtcaat 


tatctttggc 


atagaaatgt 


34560 


aaagataaac 


tgtattttct 


ctatattctt 


gctataagca 


cattgaaaaa 


taagtttttc 


34620 


tcaaaaatcg 


tgatgcacta 


taaacctcaa 


agagagatta 


aaattatcgt 


taagataaaa 


34680 


tgatagaata 


cttagaaacc 


ccaaagacct 


acctgaaaaa 


caactgeaga 


agtcaaacag 


34740 


tttagagagg 


tggctggaca 


aaagaaaaac 


ctgcaaaaat 


taatagattg 


tctttacact 


34800 


agaaataacc 


acttaaatat 


tgaaattttt 


aaaaaaatca 


aaataacagc 


aaaactataa 


34860 


atgtttaata 


agaagcattt 


taaaaggact 


gaatactttg 


aaaaattatg 


ttcttgaatg 


34920 


cgaatttaca 


aaatcataaa 


attgcaagtt 


tetgeaaatt 


aacacataat 


tgtgatgcaa 


34980 


ttccatttag 


aatctcaaca 


gcatttcgga 


agtcatgcaa 


taaaattact 


etaaaatgea 


35040 


tatagataga 


tgaataattg 


cataaaaatg 


ccaagaaagt 


ctagaagtta 


catagtgatt 


35100 


gaggaggatc 


agataatttg 


ctgcactaga 


tattaaaact 


tgttattagg 


ctattataat 


35160 


aaaaatagta 


cagtatttac 


acactaaaag 


gaaaatttaa 


ctaagcacag 


aatagagagt 


35220 


tgaaaaataa 


atctaagtat 


aagtgaacac 


atattatttt 


gtaggtgata 


acatgaccag 


35280 


tcaatagaga 


aaagatgaaa 


ttggaaaaaa 


acagtattaa 


atcaataagc 


tattgatttc 


35340 


caagaaaata 


ataatggaca 


cctactttgt 


accacaaata 


aaaataaatc 


acagatgaag 


35400 


tatgtgtaca 


taaaaaccat 


taaaattagc 


ataaaatata 


gaactaatat 


attatttaca 


35460 


taagcaacag 


agaaaaccaa 


gaaggaggaa 


agatagaaag 


acattttatg 


tcacaaatat 


35520 


ttgcactttt 


tatacaatac 


aggctgagac 


aacaatagee 


taaagagtta 


aaatactgee 


35580 
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aaaaaataaa 


actatcccct 


3564 0 

—J ^ V 


agctgactca 


gaagacagaa 


ccacagagga 


ttatcctcag 


gectcaaaaa 


tctaacaaaa 


35700 


ttttctctat 


tgggttttgg 


acttgettag 


gtctgctgcc 


cacttctttc 


tttcttattt 


35760 


tctctattgg 


aaatcgaaat 


gactatcctc 


tgccagtccc 


accattgtat 


tttagaagta 


35820 
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gattactagt 


tttctggctt 


cacaagtcca 


cagatagtga 


ggaatttttg 


ccccaagatg 


35880 


5 


aatcatgcta 


taagtgatga 


tttctagcat 


gaatctagat 


gctataactg 


atttagatga 


35940 


tgagatttga 


ggatttttga 


ttttataata 


tttaggaaaa 


gtttggacat 


agagttgatt 


36000 




ctagagtggg 


ttacaacttt 


cggagatgtt 


gaaattagct 


gaatatattt 


tgcacatggg 


36060 


10 


aaggacatga 


attgagggga 


actagaggca 


gactgttatg 


ggttgaattg 


tgataccgaa 


36120 




acatatatgt 


tgcagtccta 


acccatggca 


cctcagaatg 


tgacattttg 


ggggaatggg 


36180 


15 


attattccag 


gtataataag 


ctaaattaag 


atgaggtcat 


acaggggtag 


gtaggtcatt 


36240 


aatccagtat 


gattggtatt 


cttataagag 


agaaatttgg 


acataggtac 


ataaggaaga 


36300 




taacactata 


tat taat tea 


tttttacact 


crc tat aaacra 


ac tciac tcracr 


acagagtaat 


36360 


20 


ttataaagga 


aagaggttta 


attgactcac 


agttttgcat 


ggctegggag 


gtctcacaaa 


36420 




acttatagtc 


atgacagaag 


gcaaaggaga 


agcaagtacc 


ttcttcacaa 


ggaggcagga 


36480 


25 


gagagagcaa 


aaggggaaca 


gctacagact 


tttaaaccac 


catatctcat 


gagaactcac 


36540 




tcactaccat 


gaaacagcat 


gggggageca 


ctcccatgat 


ccaattacct 


cccaccaggt 


36600 




ccctccctca 




at tacaat t c 


craoatQaQat 


ttQQQtQQQQ 


acacaqaqcc 


36660 


30 


aaaccatatt 


attccatccc 


tggcccctcc 


caaacttcat 


ctcctgctca 


catttcaaaa 


36720 




cacaatcatg 


ccctcccagc 


agtcctccaa 


agtcttaact 


cattccagca 


ttaattcaaa 


36780 


35 


agtctaagtt 


caaagtctca 


cctgagacaa 


ggaaatttcc 


ttccactgag 


cctgtaaaat 


36840 




caaaaacaag 


ttagttactt 


ccaaggtaca 


atgggggtac 


aagcattagg 


taaatgttcc 

* 


36900 




cat t ccaaat 


crcjQacaaat t 


QQCcaaaaca 


aaqqqqccac 


aqqtcccatq 


caagtctgaa 


36960 


40 


acccagtagg 


gctgttatta 


aattctaaag 


cttcaaaatg 


atctcctttg 


gctccatgtc 


37020 




tcacatctgg 


ggtacacaga 


tgcaagaggt 


gggctcccaa 


ggccttgggc 


agctccaccc 


37080 


45 


ctgtggctct 


gcagggtaaa 


gcctctgcag 


ctgttttcat 


aggctgatgc 


tgagtgcctg 


37140 




cagcttttcc 


aggtgcgttg 


tagaggctgt 


cagtggattt 


accattcttg 


ggtctggaga 


37200 




atQQtQQCCC 


tcttctcacq 


qqtctaqtqq 


qcaqtqcccc 


aqtqqqqact 


Ctqtqqqqqc 

ZJ ZJ ZJ ZJ ZJ ZJ 


37260 


50 


ctcaacccca 


catttcccct 


ctgcagtgcc 


ctagtagaag 


ttctccatga 


gggctccaac 


37320 




cctgtcacag 


ccttcttcct 


ggacatccag 


gcattttcat 


acatcctctg 


aaatctaggt 


37380 


55 


ggagcccacc 


aagcctcaac 


tcctgacctc 


ggtgcaccca 


caggcccaac 


accatgtgaa 


37440 




agccaccaag 


gcttggggct 


tgctccctct 


gaagcaacag 


cccaagctgt 


accttggccc 


37500 




cttttagcca 


cagctggagc 


tggtgtggct 


gggaagcagg 


gccccaagtc 


ccgagactgc 


37560 


60 


acagagcagc 


ageaggggee 


ctggacctgg 


gccacaaaac 


cattttttcc 


tcctaggcct 


37620 
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ccaggcctgt gatgggaggg gctacagtga 
cccattgtct tggctattaa tatttggttc 
5 cttgaattta tccccagaaa atgagttttt 

tccaaatttc tatgctttgc ttccctttta 
tgctcacaca tatgtgaaca cacctctaga 

10 

tgcttagaaa tttcttccac cagataccct 
aaatctctag ggcagggcaa aatgccacca 
15 ctttgctcca gttcccaaaa agttcctcat 

cattgtccat atcactatca gcatttcagt 
cctaactttc ccacatcttc ctgtcttctt 

20 

ctatttccca gttccaaagt tgcttccaca 
ctcctggtac caatttcctg tgttagtcca 
25 actgggtaat ttataaagta aagagggtta 

gcctcaggaa atgtacaatc atggcggaag 
ggaagcagga gagagagagt aaagggggaa 

30 

atgagaactc actcactatc atgagaacag 
acctcccacc agatcccccc tcaacacatg 
35 gggacacaga gatgaaccat atcaccatct 

attacaagtc aaaaacacct ggggctacca 
cctatatgcc ttagaaagag catgatcctg 

40 

cagacccata aaacaataaa cttctgttgt 
atattgccct aggaaatgaa tacatgagga 
45 atacttagtt gttatgaaaa tacccaaaaa 

aatatttact cctgattttc taaatgttga 
tgaagatttt tttattcaga tggttggtat 

50 

catcttagag atgaccattg taatcataat 
tgaaaaacac actgaagaaa gtgagtaact 
55 cagtttataa taagggccct ctctgagagt 

tgcttcagaa gtaaggtcaa gtgtatctga 
tgtgtggaat tgaagtcaaa caacttgccc 

60 

tacatctccc aggagattaa cagaggaaat 
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agatgtttga 


catgccctgg 


agacattttc 


37680 


cttgttactt 


atgcaaattt 


ctgcagctgg 


37740 


cttttatacc 


atgtatcagg 


ctgcaaattt 


37800 


aacacaactt 


ccagtttcag 


ataatctctt 


37860 


aaaaaacagg 


tcacctctta 


aacactttgc 


37920 


aaatcatctc 


tctcaagttc 


aaagttccac 


37980 


gtctctctgc 


taaagcatag 


caagactgac 


38040 


ctccatctga 


aaccacctca 


gcctggactt 


38100 


caaaagcatt 


caacgagtct 


ctaggaagtt 


38160 


ttaagccctt 


caaactcttc 


caacttctgc 


38220 


ttttcaggct 


atctttatac 


cagtacccca 


38280 


ttttcacact 


gctataaaga 


gctgcctgag 


38340 


attggctcac 


agtttcacat 


ggctggggaa 


38400 


gcaaaggaga 


aacaagtacc 


ttcttcaaaa 


38460 


cttccacaga 


cttttcaacc 


atcaagtctc 


38520 


catgggtgaa 


actgccccca 


tgatctaatc 


38580 


gggattacag 


tttgaaatga 


gatttaagtg 


38640 


gaacatagag 


gcagagatag 


gagttatact 


38700 


gaacctggat 


gagacaaggg 


aggattctct 


38760 


ccaatacatt 


tattttgaac 


ttttggcatc 


38820 


gttaagccac 


ccagtttgtg 


gtactttatt 


38880 


caatatgtaa 


gtactcaata 


aatttaaaac 


38940 


taatcataat 


tgtgtgagac 


tgctaggatg 


39000 


tcttatataa 


ttttaaatgg 


ttaattaata 


39060 


gatgctttta 


caatatttac 


tactctgcac 


39120 


catccagtgg 


gttgtaagag 


catttcatca 


39180 


ttgtttctgg 


cagtcaaatc 


aagataaagg 


39240 


tcaaaqttta 


qacttqgtga 


acaqqaqtqt 


39300 


aaacatgctg 


aaatatcttc 


tgcttgatat 


39360 


tggcaaaact 


atgtagaagt 


cttttagact 


39420 


atatagacta 


aaagggtttc 


agagatcata 


39480 
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cttgtgatat ggggatactg actgctataa 
ttggtttcca acatttggaa tatcaaatgc 

5 

gaaaagagta cttctagaat agaacagtaa 
gcaatgagaa tactgaaaaa tttgtcaaaa 
10 aaagacctat gaaaatccaa taagcattta 

acagtgagct ctatggtatt gcagcgtgcc 
agtagccttt aaaagtaaca gccttgcaac 

15 

aacaacagga aaggataaaa taggatttaa 
ttgttgttaa cttgtctggc agctccccag 
20 atttcattca gaactcagtc actgtaaaca 

atcaatagca tttaacaaca cagctgcctg 
ggtcacattt taaaaaaaaa tctgagcagt 

25 

acaatatctg gaaacataaa aggtcacatg 
agacctgaaa aatgtcctaa cctcccacct 
30 cattcaagtt aaatggccga atcatggtaa 

tcctattccc atccctcaca acccatctcc 
ctaaggcaga gttgcaaact ctcccagagc 

35 

caaacacgca cacacacaat ctctcagcaa 
taaggaaatc tttgtccaat cactagctga 
40 acaatgataa aaccaacccc tagagggcta 

ttaaatgttc aatttttaac aagtaaaaat 
gcttaaatac aggagaaaaa aacagtcaat 

45 

gaggttacta gaaaaaaaag gtattttaac 
ttaaaaacca tgtctaaaaa aactaaaagt 
50 attaatagtg acataaaata ggaactattt 

cagaaaatta taataactaa aatgaaaatt 
ctggcagaag aaataatcca caaacttgac 

55 

aaacagaatg aagaaaaaag aacagagctt 
caaaatgtgc ataatggtga acacataagg 
60 acagtgagtt tgtgaagaaa tattggaaaa 
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acccaaaaat 


tcaaatttag 


ttctgaaatg 


39540 


cttcagaaaa 


taaacatttt 


agaggacact 


39600 


gaacctccaa 


aaacccatct 


ctcaataaaa 


39660 


tcaacttttt 


cagaactttg 


gaaattagtc 


39720 


ttcaagttaa 


atggccaaat 


catggtaaga 


39780 


ctattcccat 


ccgtcttacc 


ctatctcctc 


39840 


catagtacct 


gtgggaaaaa 


acggcctaac 


39900 


gtgccccaaa 


aaagccccat 


tctcagagaa 


39960 


acatccccac 


tggcaagact 


ctttaatttg 


40020 


gcttttcttt 


ctgggatatt 


tgtcagaaca 


40080 


aggcagagat 


aacagttgtg 


ccaaacaaaa 


40140 


aagatgtcca 


tagagtattt 


tgaaattatg 


40200 


catatgctgg 


gatgtgtaca 


tgccaagaaa 


40260 


ctggctgacc 


tgaagaaaat 


ccaattcaac 


40320 


gaacagtgag 


ctttgtggta 


ttgtaacttg 


40380 


tgcttttatg 


ataaaagcag 


gaagtgatag 


40440 


tttaaaacct 


taccccaaca 


caaacacatg 


40500 


agtctaggaa 


acttactggt 


tcaaaagaat 


40560 


ccactaagtt 


aacaaagcag 


aataacaaag 


40620 


tgatttacaa 


tgttgccaca 


ctatattatt 


40680 


ataaaatatg 


caaggaaaca 


gagaagtatg 


40740 


aaaaattttc 


cccaaagaag 


tctaaatgtt 


40800 


tcagcaagta 


taaataagta 


taaagaactt 


40860 


atgaacgcaa 


tgtctcacca 


aaaagagaat 


40920 


ttccaaaaga 


accaaataga 


aattgtagag 


40980 


ttactcaagg 


gtctcaacag 


aaaatttgag 


41040 


agtaggtcaa 


ttggaattac 


ctaatctatg 


41100 


cagacacctt 


tgaaacatca 


gcaagcatac 


41160 


aaagaagaga 


gaaagaaata 


aaaaaatggc 


41220 


tgtatagctg 


taaagactat 


attttgaata 


41280 
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agtatctgaa 


atcaaaaacc 


caactttccc 


ccttaaaaaa 


tcaggaaaag 


aaaactaaac 


41340 


ctaaagtgaa 


aataataaag 


actagagtag 


aactaaatga 


aatagaagat 


aggaaaacaa 


41400 


tagagaaaaa 


caataaaacc 


agaagttcct 


tctttaaaaa 


gaatcccaaa 


ataacaaacc 


41460 


attaaccaca 


ctgaccaaga 


aaaaaagaaa 


gaagacttaa 


tttactataa 


ttataaagaa 


41520 


agaacattac 


tgacaatgtt 


atagaagtaa 


aaatattgaa 


caatcatatg 


gcaacaaatt 


41580 


agataaccta 


cataaaatgg 


acaaatttct 


agaaatacag 


acatgactga 


aactgactca 


41640 


agaagaaata 


gaaatctaag 


tacacctatc 


acaagttaaa 


gaatcaaatt 


aataatcaaa 


41700 


aactttccat 


aaagaatagc 


ccagtctcag 


atagctacat 


tgtgaactct 


accaaaggtt 


41760 


taaagcatta 


atactagtta 


ttcacaaact 


ttttcccaaa 


aaaaagagga 


ggaataaata 


41820 


cttcctaact 


cattctataa 


ggctaacact 


agtttggtac 


taaaataaaa 


caaagacatt 


41880 


ataataaaag 


gaaactatag 


actaatatcc 


attatgaata 


tagatgcata 


aatcctcaaa 


41940 


aactattaac 


aaatggaatc 


tagcaacata 


caaaaagtat 


tatatatcat 


gaccagttag 


42000 


gatttatcct 


aagagtacag 


agctggttca 


atatatgaaa 


actaatcaat 


gtattccacc 


42060 


ttactaatag 


aataaaagcc 


agaaaccaca 


tgatcacctc 


aaaagacata 


gaagaagcat 


42120 


ttaagaaaat 


tgaacaccct 


tttatgatta 


aaaaagttga 


aaaactagga 


atagaatgga 


42180 


attttctcaa 


cctaaatatg 


ggcgtttatt 


aaaaatttat 


agttaacata 


atacttgata 


42240 


aagagacact 


gaatcatttt 


cttcttaaga 


taagaaacaa 


gatatcttgt 


cacttacatt 


42300 


caacattgta 


ttgtaggttc 


tatccaggaa 


aatcaggcaa 


taaaaaataa 


agaatttagt 


42360 


ttaatcaagg 


aaaaattaaa 


actaacactc 


tttacaaatg 


acattatctg 


atctcgtata 


42420 


aaatgctaaa 


aaaatccata 


aaaatctagt 


cgaactaata 


aataagttta 


acaaaatggc 


42480 


agaatacatg 


gccaatatac 


agaaaccaac 


tgtatttttg 


tatacaaaaa 


cacagttaca 


42540 


gtaacatcaa 


aaggaataaa 


tatacatata 


cttcacattc 


aggaataaat 


ttttaaaaag 


42600 


aagtgcgaga 


cttgtctacc 


acaaactcta 


aaacattatt 


gaaagaaatt 


taagaagata 


42660 


caaacaaatg 


aaaagacatc 


ccatgttcat 


tgataaaaga 


caatatttga 


agacaatatt 


42720 


atgatggtaa 


tactccccaa 


ccaatttaac 


acaacccctg 


ttaaattaca 


gctggcattt 


42780 


ttcagaaatt 


gacaagctta 


tcctaacatt 


catatgaaca 


tggaaaagac 


ccaaaatagt 


42840 


caaaccaata 


ttaaaaaaga 


aaaacaaagt 


aggaggtgtt 


aggttttgaa 


gggaaggtaa 


42900 


gggt uaaaga 


aagacacaca 


cacacacaca 


cacacacaca 


cacacacaca 


cacacacaca 


A O Q & C\ 
*t <C O U 


taaaaagggc 


acctcagcag 


caaatgcagg 


ctttacatcc 


agcataaaac 


ctacagaagt 


43020 


ggggaaccag 


tgtaatgcca 


gagcccaccg 


ctgcttatag 


cctgggggta 


cttataggta 


43080 


tgggtgggag 


gggtctaggc 


agtacggctt 


gctacccagc 


aggatattga 


taagatgttc 


43140 
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ccatgatgag gtggttctgg cccttggttt 
ctttgcccag caagatataa tagggatgtt 

5 

tcatgtggtt aggcagaatg tttctcacgg 
ccaaggtctg caaaatagca gagaatttac 
10 ctcgcacttc ctagtttcaa aagttactaa 

agcaggatga gccgtggaca aaactccaca 
atcagctgga agcatcggca gactagcgtc 

15 

tctgtccctt tttttttttt agcgctcaca 
attgattgag cagtctaggg tggtaccgga 
20 agagtgaaac agaacagaat ggaaaatttg 

ctatggataa catcagttgc taagtcatgg 
gcaggcccag gcctggtttt gggtctggtt 

25 

gcctttggtt tcgcttcctt gtttcttctt 
aacaatatgg gggggtctct ttctttcttc 
30 aaaaatcagt agaataggat caaaaaccca 

ttgactagca tgacaagaca attcaatgga 
aggacaactg gatatctgca tacaaaagaa 

35 

cacaaattaa ctaaaaatgc atagacctag 
gaagaaaata caggcataat tatttctggc 
40 caccaaaagc atatttaaca aaagaaaaat 

acagaccaac atgtctgatg actacggata 
caaatctagc aacatataaa tatgattata 

45 

attcaaagtt agtttaacat accaaaatca 
aagggcaaaa acatgctcac taaacaaagt 
50 gattttatct ccttacttat ctatcttccc 

aaaaacaaaa atctgtagtt tggtgtcaac 
cttgtttgtg gaattcctgt aggataatat 

55 

agaagtacct tgagaccaga tcttaatggg 
atttagtcta tagataccag ggtgctcttg 
60 aaacttttca tttatataat tattttcatt 
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ggcagaatgt 


cattatggtg 


ttccttggac 


43200 


tctttagttg 


ggcctttgtc 


tgccttgtgg 


43260 


cccgaactcc 


tgtgaaatgt 


ttcactttga 


43320 


aaaatggtgc 


agtttggact 


aacaggaagg 


43380 


aaaattacag 


taatcaagaa 


attgtggtgt 


43440 


gacactgaga 


aagtgaagga 


agtagcttta 


43500 


ttaaaatcca 


agcttgttga 


gtgcacaatt 


43560 


atactaaagg 


tttcgcatga 


aagggttgtg 


43620 


caggggctgc 


agctacatgc 


accagtaatc 


43680 


ttgtatgtcc 


ttccacacaa 


tgtctggaat 


43740 


gttgaatttt 


aaccatcagg 


ctaaggtcag 


43800 


ttgggtctgg 


tgcctggcgc 


caggctgcct 


43860 


aaaacaggta 


ctgagtataa 


aacagtatag 


43920 


tctcagtgat 


actgccataa 


tgatagacat 


43980 


aaaataagtt 


catacgtagc 


tacttgattt 


44040 


gaaatagtgg 


tctcttcaac 


aaatggtgcc 


44100 


tgaagttaga 


ccactacctc 


acagaatata 


44160 


atttaagagc 


taacattaca 


aaactcttag 


44220 


ctgaattagt 


taatggtata 


ttagctatca 


44280 


catagataaa 


ttgaccaagg 


gaagaaaact 


44340 


caaaaatctt 


caaaaaaaaa 


actagcaaac 


44400 


caacagggtc 


aagtgagatt 


tattacagga 


44460 


atcaatgcaa 


tatatttcat 


taaaataata 


44520 


gatgaaagga 


aatagattgt 


aatggctgat 


44580 


taaatccttt 


caaaatagat 


aattaattcc 


44640 


ctaaaactct 


gtcaggagcc 


aaaggaatgg 


44700 


gtgggggatc 


atggcaggaa 


atgaggctgt 


44760 


gcttaactat 


accttattaa 


agaattttga 


44820 


cagggttttt 


aacaacagtg 


tttcttgatc 


44880 


agaatatttc 


ctgcagcaag 


ttacagccct 


44940 
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tcccccctac cctcaactca aactgctaaa 
cggcttaggc agcataattt ccggttcaca 
5 gctacaagga gacctctctt tacttcactt 

gagttgagta atagcaaagg gaataggatt 
gcaggataat tttgagggag tcaaccagtg 

10 

tagagatagg aaaaagaaga acaaagctag 
aggcaagcca aacaatgagt acctaaacct 
15 atttgaaaga tatttaggag gcttattaat 

aaatcttaga ttaaataaaa ttagtttaat 
tactgctcaa taactattga acatagcttt 

20 

aaactatttt ttaaatcatg tattttcagt 
ttctgctgtg aacttaaaaa caaaatagaa 
25 actgttccca ttagaatggc ttctacagtg 

tggctctccg ctccaatggt tgttctcttc 
aggaatcagg actcaccaga gtccagttaa 

30 

tatgtttgtt gtagttcatt gaaacctctg 
tttggctagg ccccccagaa ctgagtaggt 
35 acattccagg tttacatatt tgttcatttt 

tgtctcagga taaggctttt attttagact 
atctcaggca ctttgccctc ggaaatctca 

40 

acaacttcta tttcttcata atttgcaaag 
attaattgtg aattagcgaa gaactctaat 
45 ctaatgaaat cctttccatt gctctttttt 

tcacaaattg actcttccct ttttaaaagc 
cctgcaatcc ctgtggttga agggatgtag 

50 

aggttggagg agattggctt gaattaagga 
gattttggaa gttggcataa tgttcatgtg 
55 ttttgttttc tcatctgtaa aacagaaata 

gattaaatga caatatatca ttaaaagagc 
tcaataagtt tttattattt attatttatc 

60 

atatatgaag ccagattgaa tttttcttgt 
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tcaatcagga 


atcatctcat 


gatcataaga 


45000 


tccagcagga 


gaaactggcc 


tttctcagaa 


45060 


ggcttaattg 


cctcagatct 


gctaatctct 


45120 


accatgaagg 


attcagacaa 


atcatttgta 


45180 


tcctcagtca 


ttctgggagg 


aaaatagaga 


45240 


agatgaatat 


aattagctgg 


ttgccccaac 


45300 


aaacaatgaa 


gaataaagag 


aagatgataa 


45360 


gtcttgtatg 


ttcgtttcac 


actcaatgtt 


45420 


aaactgctta 


tcccttattt 


ctatttccca 


45480 


aaatactaac 


ctgtatttta 


cttgattagg 


45540 


tttgtgtgta 


tgcgcttata 


cgatataata 


45600 


aaacagaggt 


ttactcaaga 


aaagaatgac 


45660 


gctcccaaaa 


atcctgtagt 


atgagccaga 


45720 


tgatctagaa 


tatgaagtag 


agtacccaag 


45780 


aatcatttta 


agaagaataa 


tttagtttaa 


45840 


gtatctgatg 


agttataggt 


ctgatatgcc 


45900 


actgtcttct 


cagtacactt 


ttaactaacc 


45960 


ttgttgttgt 


tttgtcttac 


tttaactcaa 


46020 


gtaataatga 


tcaatttgct 


atctcaattc 


46080 


ctgtcggaat 


ccttggaaat 


cacaccacgc 


46140 


aaccagtgtg 


actccagtca 


agaacaaaag 


46200 


tttttcaagc 


agccacatgg 


aaaacgcatt 


46260 


cttagttctt 


caataaagca 


tgcgatttga 


46320 


cagtatgggt 


taacacggat 


aaatatagcc 


46380 


gagagtagtg 


ttgaatgccc 


actttatctt 


46440 


actccttcat 


ccaagaggag 


aaaatactct 


46500 


ataatgtcaa 


tactcttgct 


ctctctgtac 


46560 


o 4~ 2a <t f~ /—* t~ \~ ra 




tagaugggag 




ttacaacagc 


tcctaactca 


tagttgtccc 


46680 


agctcttatc 


attgtatcag 


tagcataatg 


46740 


tttccaagaa 


ttggtgacta 


gaacctccct 


46800 
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ctgaaagcag 
atgtattgag 
aatatgacct 
taggagagag 
cgcagtggct 
aggtcaggag 
caaaaattag 
caggagaatc 
actccagcct 
acacagacac 
gtaaagatga 
aatcaggagg 
agagcacctt 
taacaccttg 
cggggaaact 
ggaacatgtt 
agtgcagcat 
gtgtgtggct 
ggacttgctg 
atggtatagg 
aggtaaaaaa 
gggaaatgag 
atatgaaagc 
ttaccaaaga 
tggccagtag 
ctatattctg 
ttatatgtat 
atcataattg 
aaagcctctg 
cagtcttatt 



atgtactgaa 
ttgtgtcccc 
aatttagaag 
tctaatccaa 
cacgcctgta 
ttcaagacta 
ccgggcatag 
acttgaaccc 
gggcgacaga 
agatatgcac 
aggattgggt 
acgtcaggaa 
gattttggac 
ttttaaccac 
aatgcaggtg 
aggtgtttat 
tcgaacggct 
gtttacagga 
acctgatcta 
agcctgcaga 
gaaagatacc 
actggaagcc 
acttcagcaa 
agcactaatg 
aagcttccac 
acatgggagc 
aaaatttgtc 
tataatttat 
ctgtgaacat 
ctgatgttat 



atgaggccac 
ctcaaaaaaa 
tagggtcttt 
tatcactggt 
atcccaacac 
gcctgaccaa 
tggtgcgcac 
aggaggcaga 
gtgagactct 
agagggaaaa 
gatgcatcga 
ggatccttct 
ttccaacctc 
tcgttttaac 
gcttttcact 
gaaagcaggt 
ctactctctg 
agcatcccac 
gaaaagccag 
gagggaccat 
tatcaccact 
tcaggttacc 
aatgctgcca 
cctaggagag 
atacctgcct 
tcatttaatt 
attttatagg 
atgcatattt 
tacatattga 
ctccaggagt 



aaaactactt 
tatattgagg 
aaataaccaa 
ttccttataa 
tttgggaggc 
catggtgaaa 
ctgtaatccc 
gattgtgatg 
catctcaaaa 
cgatgtgaag 
caagccaaga 
cctccaggtt 
cagaacagtg 
cactcgtttt 
ataaggggca 
tatcctctca 
aagctcgggg 
tccccaacga 
agcaaaagcc 
ggaccactcc 
gcctgcaggc 
tgagaattgc 
atggagggtc 
agcttctcag 
aggttaggaa 
tcatttaatt 
catattttaa 
taaatgtgca 
atgtatatgt 
ccaccataca 



aaaatcttga 
tcctaatccc 
tcgaattaaa 
aaagagaaaa 
cgaggcaggt 
tcctgtctct 
acctactcgg 
agactagatc 
caaagaaaag 
acacacaaag 
agtgccaaag 
tctgagggag 
agataataaa 
aacaccttgt 
ctagatttcc 
cctcagcatt 
agagaagaag 
tagtctttgg 
agcaatgtta 
taaaagtcgg 
cccaaccctg 
tgtagcatgg 
agtacgaagg 
aattgagtgg 
tataatccac 
tttttaaaat 
atgtggatat 
tattttgtaa 
ttgggttgtc 
atttctccag 



tgaaaatgtt 
cagtactcag 
atgaagtcat 
ttgggccagg 
ggattgcttg 
actaaaaata 
gaggctgagg 
gcaccattgc 
gggaaattag 
agaaaaccat 
atttctagca 
catgaccctg 
tttctcattt 
tacattagct 
cccaccccat 
ccacagtagg 
cagctgtaaa 
gacccaggtg 
aaaagaaggc 
agcttgcaag 
gaacactaga 
ggagcaaggc 
tgtagccaaa 
ctcttaactg 
agaaagagaa 
aaaggttttt 
tttaatttat 
cataattttt 
tgttgttttt 
agcctctgtt 



46860 
46920 
46980 
47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
48480 
48540 
48600 
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acatctatct 


tcttccaatc 


catccttgat 


acacttttaa 


agataacagc 


acccaagatc 


48660 


atgttaagta 


ctagagactg 


aacatatatc 


tctacagata 


gtgggatctc 


tcatagttca 


48720 


aactttgtgt 


gggatagttt 


tctaattccc 


cattccatgt 


aataacaaaa 


gaaattaata 


48780 


ctgtatttgc 


cctggttaat 


tatgagacta 


tttctcactt 


tttctacttc 


agcactctgt 


48840 


aattggcctc 


agcctgttct 


gcctacacag 


attaacatta 


tatcctcctt 


cctcctatat 


48900 


tagatgaata 


agtctgttct 


tttattatct 


cattctattc 


tttattattt 


ataatagtaa 


48960 


ttcccattaa 


tacctctgct 


ctttcttttt 


tatatatttt 


ttgtacatat 


gacttctgat 


49020 


ctttttctct 


tacatacaaa 


gtattattat 


aaaaatgtac 


aggcatatct 


aattattatg 


49080 


tatttcagta 


tattcaggtc 


tgacatttta 


tataataaaa 


tggatatttt 


ataataaatt 


49140 


tacttgccct 


ttatattaca 


gttaagcatt 


atattgatct 


cttaaattat 


gtagatctca 


49200 


ttatttgtag 


tttgtatctc 


aggatatgaa 


aaatattaac 


tctacattta 


ttttactgag 


49260 


aaatttttaa 


accatctaga 


ttttgactaa 


aagtcttagt 


acagactttt 


aattccatcc 


49320 


aatgtgaaat 


aactgatatc 


agatatactg 


aaactagcta 


caaaaactgg 


ggaaaatgea 


49380 


taaaacatca 


atttttaggc 


actgtacaaa 


aaccaacaga 


aggtttccat 


tctttggaaa 


49440 


tgggaaacat 


acaaggtgag 


cccccacatt 


cacccagact 


tccccttggg 


aacattttcc 


49500 


aggccatgtg 


gtaggaacgt 


agaacccaaa 


tagttacagt 


ggtcttgctg 


gatggaagca 


49560 


gagattaagt 


ttgaaactgc 


taaagtagct 


ggaattcgat 


gggcatacct 


acagagatag 


49620 


ggaagctact 


gaacagatac 


ctcagaagtc 


tacaaagaaa 


ttttccttag 


gtccttagct 


49680 


gaatcctggc 


catatccttt 


aatacaactt 


cagaaggcct 


aacagaaagt 


aaattcctgg 


49740 


ccgggtgtgg 


tggctcacgc 


ctgtaatcac 


agcactttgt 


aagggaggee 


gaagagggca 


49800 


gatcacttga 


ggtcaggggt 


ttgagaccag 


cctggccaac 


atagtaaaac 


cccatctcta 


49860 


ctaaaaacac 


aaaaattact 


caggcatggt 


ggtgggcgcc 


tgtagtccca 


gctactcagg 


49920 


tggctgaggc 


agaagaatca 


cttgaaacag 


gagacggagg 


ttgcagtgag 


ccgagatcct 


49980 


gccactgcag 


accagcctgg 


gtgacagagc 


aagaccctgt 


cttaaaaaaa 


aaaaaaaaaa 


50040 


gaaaaaaaaa 


aggaagtaaa 


ttccagggag 


actgagagct 


gagcatggat 


tcttgatcct 


50100 


acacagtgtg 


gtgagtggtg 


aagtgcaggc 


gcagccaaaa 


gagagaaacc 


ttcttgatca 


50160 


tccctgacat 


tcaaaggata 


ctccagagat 


gctacagctt 


ctaattaaga 


acgatgaact 


50220 


act anhararr 

<=*yyc*y a. i_ ^y 




y y cl cl L- ciciM cl a 


3 33 ClCICIr^ CI P\ P\ 

cl cl ci y y y cl y cl cl 


CL CL CL v-j C*. C*. v» 


cctaacaaaa 


5 0280 


cctaaaatca 


ggatgcaaga 


tgatctgcaa 


aaatgtaatt 


gctggtgtgc 


ctcctccaag 


50340 


gagaggaaag 


agaatagtac 


ccagatcctc 


acattttgaa 


cagatcatct 


aggaaagaac 


50400 


acttggatac 


agcagagaac 


agacgagaaa 


caccagcagt 


aagaaaggac 


atggtgtgag 


50460 
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gaagcttgcc cagccaggaa ctgaccaaga 
aggaagagag aaacccctag aactccaaaa 

5 

accctgtagt ggattcacga cagcatcagg 
cagagatgtt gccccagaaa gggaacccac 
10 cagcctcagc tgggagccat tttgagagcc 

tgccactgag cagctccaag gagggagagg 
agggtacctg ccgccctgct atgggctgca 

15 

ccccacagct tcttgctcac actgcttgcc 
ccccaaggca ccatcttaag agttgagtgc 
20 aaactaacac aattacagct gctgcctagc 

atgcaaatat aggacaatac ccactgatct 
ggggatgaca ctccccacag cttcttacat 

25 

ccctggccac aggcctaagg tgccattttg 
caggccaaat tcaagttgac atggctgcaa 
30 ccaggctttc cgacgcatac ctaggacaat 

accaacactc aagcgaacca cacttctcac 
gggcctgacc ctctctggtc acaagcccac 

35 

agctgtgccc cacccttggg ccaaggtcaa 
agagagagac aaggaagacc aaggtctcca 
40 gctaaaggca gctgtgggac cagggagtaa 

caccaacaca gatggcatgg gtcactgtgg 
gccagcttcc caggggcctg agactccacc 

45 

acttctaagc aagcagcttg gaggcccaag 
gccagtataa gccactctgg agccttaaaa 
50 tcgggcccaa agactggctc agttggtgta 

tacctgatgg ccaggcacag tatttgggag 
agtttgagac cagcctggac aacatggtga 

55 

gatgggtgtg gtggtgcaca cctgtaattc 
cacttgaacc tggaaggcgg aggctgcagt 
60 tgggtgacag agtgagaccc tgtctaaaaa 
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gccaagagag 


gctcctggac 


ctagggaaac 


50520 


tgggctttta 


tgatcttggc 


tatgggagaa 


50580 


cctggcatat 


ggagcttcct 


aaagattgca 


50640 


acagaatccc 


acaagcttca 


gcacccagag 


50700 


tggataccgg 


gtatctacag 


acacagctgc 


50760 


gaagaacagg 


tgctcccatg 


caccaatgag 


50820 


gttgagactg 


agacataaat 


gagccacgtt 


50880 


tgggagatgc 


cccaccgtct 


ctggtctaaa 


50940 


tggactgtgc 


cctgcccttg 


ggctgaggtc 


51000 


caaggagtga 


cggggaaacc 


aggctatctc 


51060 


gcaacaggct 


gctgtgagac 


tgagacgtga 


51120 


atgcttcttg 


cctaggaggg 


accccaccct 


51180 


agagtttaat 


gctgggctat 


gccccacggt 


51240 


ctactgccca 


gtcaaggagg 


gatagggcag 


51300 


attcaatgcc 


ctgccatagg 


ctgctgcaag 


51360 


agagtcttgc 


ccatactgtt 


cgcctgagaa 


51420 


agctggcact 


atttttagag 


tttaacacta 


51480 


ggtgacacaa 


cagcagtcgc 


cacccaactg 


51540 


aagcacactt 


agaacaatac 


ccactgccct 


51600 


cccaccccaa 


tccattgcag 


cttccagcaa 


51660 


tttcctccac 


cactgctact 


gccacaccac 


51720 


cacacaccag 


gcccactgct 


cccactgccc 


51780 


aatgagccct 


ctaggaccgc 


taacacaagg 


51840 


caggctcgct 


caccccaatg 


ctgccactac 


51900 


ttagtttgtt 


ctcacactgc 


tatagagaaa 


51960 


ggtgaggcag 


acagatcact 


tgaggccagg 


52020 


aaccctgtct 


ctactaaaat 


acaaaaatta 


52080 


cagctacttg 


ggaggctgag 


gcatgagaat 


52140 


gagctgagat 


cgcaacactg 


cactccagcc 


52200 


aaaaaaagaa 


gaaagaaaga 


gagagaggga 


52260 
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qaaaqaqaqa 


aagagagaaa 


ggaaggaagg 


aaggaagaga 

—> w' W^ w-J 


aagaaaaaaa 

wV 


52320 




W» ww ww ww w* WW WW V— *- WW 


aaoaqaaaqq 

WW WW WW WW WW ww —J 


aaaqaaaqaa 


aqaaagaaag 


aaagaaagaa 


agaaagaaag 


52380 


5 


aaagaaagaa 


qqaaaqaaaa 


agaaagaaag 


aaagaaagaa 


agaaagaaag 


aaagaaagaa 


52440 




aoaaaqaaaQ 


aaagaaagaa 


agaaagaaaq 


aaagaaaaag 


aaagagaaag 


aaagaaagaa 


52500 


10 


a o cr a. a. a ci a. c c 

WW W^ <hA l^-A. WW W*« W>« 


tqaqactqqq 


taatt taaaa 


gaaaataggt 


ttaattgect 


ggtggttcca 

* w' — 


52560 


cqcfactQtac 


aqqaaqcatq 

WW ~J —J WW ww "J -w— w— — — 


qcaqcatatg 


ettctgeaga 


ggcctcaggg 


agcttttcct 


52620 




catgacagaa 


ggcaacatgg 


gaacaggcat 


cttacatggt 


aggagcagaa 


ccgaggcggg 


52680 


15 


aaaatqcccc 


acactttfcaa 


acaaccagat 


atcacacaaa 


ctcactcact 


attgtgacac 

w^ 


52740 




aqtaccaaqa 

WW W ww w -w- ^-w ww ^1 J^J 


qqaaaatctq 


ctcctgtgat 
— ^ — ' 


tcaatcgcct 


gtcaccaggc 

~J w^ W# 


cccacctcca 


52800 


« 

20 


acac tqqqqa 
attaacatct 


ttgcaattca 
aaqtccccaq 

w»*. WW "J W W w^ w^ W*' w«» 


acatgagatt 
ctaaaagtca 


tgggcaggga 
ccacaacctt 


cacagaccaa 

wV 

aactgtacca 


aaccacatca 
taagtcacca 


52860 
52920 




aggcaatccc 


agagaccact 


gactttgtgt 


actgegaaag 


aagtcaaaca 


aagatcacac 


52980 


25 


tactaccaac 


atqcaaaatc 


aaagtcgaag 


tatcttaatc 


aacaacacgt 


acacaacctc 


53040 




aoaaaaaaaa 

WW WW W*- WW WW, Vpv*. WW Wj« 


aaaaaattcc 


cctaccaaag 


caatttcaaa 


aaattggaac 


aagcaactgc 

wV w^ 


53100 


30 


tacactagat 
ggatatcat c 


qcaaaqacat 
aaaggaccat 


caatggaaaa 
aattgtccag 


acacaggaag 
caacagattc 

wJ 


catgaaaaag 
caacaaaatg 


cagggaaatg 
aatcccttga 


53160 
53220 




aatgccagat 


aaataattta 


aaatattgat 


ttttaaagaa 


gcttaatgag 


atgcaaagga 


53280 


35 


tatctaaaaa 

W* w*. w- W^ Wj W%- V-«*- WW WW 


tcaqqccaqq 


cataqtqtct 


cqtgcctgta 


atcccactgt 


aatcccacac 


53340 




tttQQQaCTQC 

w w ^ ^ -3 


catqqcaqqt 


agatcacatg 


aggecaggag 


ttcaagacca 

w/ 


gcctgggcac 

w^ w^ w' w^ 


53400 


40 


catactcraaa 
aatcaqtaaa 


CCCtqtCttt 

W^ W^ J w^ w^ 

tcaattcaga 


ctactaaaaa 
aaatgaaaga 


tacaaaaaaa 
aaataaaatc 


aatgtaagct 

w^ — ' 

tcagtacctg 


aatacaagaa 
aagacatgtc 

wr 


53460 
53520 




ttttgaagta 


attcagccag 


acaaaaataa 


ggaaaaaaga 


attacaaagg 


ataaacaaag 


53580 


45 


cctttaaqat 


ate tqaqatq 

ww W» ^— ^ w* WW — | w- w^ 


atataaaaca 


accaaactta 


caaattatca 


gtacacccaa 


53640 




QQQQQt aqaq 


aqattcaaaa 

w/ 


gtttagaaat 


ggccaggtgt 

wJ wJ — ^ w/ — ^ 


ageggctcat 

wV w/ wV 


gcctgtactc 


53700 


50 


ccacjcaattt 

w^ \ iw j w^ w^ 

taacjcaaqaa 


qqqaqqccaa 
aacaagaatc 


aqtqqqaqqa 

tgectttaca 


ctgettgaga 
aatttttttt 


tcaagagttc 
tttaaattag 


aagactagee 
ccaggcatgg 

wW — ' w^ wf 


53760 
53820 




tggctcatgt 


gtgtcatccc 


agctacttga 


agggctaagg 


tgggaggatt 


gcttgagccc 


53880 


55 


aggagtttga 


ggttgcaatg 


ggecatgatt 


gtgccactgc 


actccaacct 


gggtgacaga 


53940 




gtgaaacaaa 


acaaaacaaa 


agcttagaaa 


acatatttga 


ggaaataatc 


aataaaaatg 


54000 


60 


tcccaagaat 
aacacattgc 


atcaagagag 
aaaaaggatc 


ttagacataa 
tcaccatggc 


agatacagga 
aatattatat 


agtccagtga 
tcagaatgtc 


tacccaggca 
taaagtcaat 


54060 
54120 
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gagaaagaaa gaattttgtc agtaacaata 
tccattagac taacagcaga cttttcagca 

5 

ggtattttca aagtgcttga acaaaagaac 
ataggtttca taaataaagt cctttccagg 
10 tacaaatggt caccctacag gaaatgctca 

aatatccatc atcattaaaa cccatggaaa 
acacaaagga ggaaaagaaa ggaatcaaat 

15 

aagacaaaaa gacagaaaca aagaatctat 
acaaagcttc acatatcaat attaaccccg 
20 agatgcagat tggcagaatg ggtttaaaaa 

ctcaccttac ccacaaaaac acatataggt 
agcaaacaga aataaaagca agcaggagta 

25 

aaatcaaaaa cagtaagaaa gaacacagaa 
agatgtaaca atcctaaata tatatgcacc 
30 aatattactg cagaaaacca aataccacat 

gaaaactcat caacacaaag ggaacaatag 
ggacgaggaa gaggagcagt aacataacta 

35 

aaataatctg tacaatgaac ccccatgaca 
gtacccccaa acctaaaaaa acacacacaa 
40 gagaaaggca gcaacacaat aatagtgggg 

agatcatagg gagaaaaaat taacaaagaa 
atggatttaa caggtatttg tgcagtggtg 

45 

tgggttcaag tgattctcct gcctcagcct 
accatgcccg gctacttttt tgtgttttta 
50 gctggtcttg aactactgac ctcagatgat 

tttacaggca tgagtcactg cgcctggccc 
accattcccc aagatagact acatgttagg 

55 

aattgaaatc atgtcaacta tcttctcaga 
caagagaaat ttcagaaagt atacaaatac 
60 tcactgggtc aatgaagaaa ttaagacaga 
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caaaagtttc 


tagtcaccta 


taaagaaaac 


54180 


gaaatcttac 


aggccagaag 


agaagggaat 


54240 


tatcagccaa 


gaattttgtg 


tcctgccaaa 


54300 


caagcaaaca 


ctgagggaat 


ttgtcacccc 


54360 


aagaggtctt 


aaacatgtga 


atgaaaagtc 


54420 


tataaaactt 


acagtttgta 


taaaacaatc 


54480 


ggcaacatga 


caaaatttta 


ctaaaccaca 


54540 


aacttgaaat 


cgatcaacaa 


tatgacagga 


54600 


aaagtaaatg 


gactaaatgc 


tccactgaaa 


54660 


ataatccaag 


tatatgctgc 


ctacaagaaa 


54720 


tgaaagtaaa 


gaggtggaaa 


agatattcca 


54780 


gttatcctta 


tgtcagataa 


aatagacttt 


54840 


gggcattaca 


tagtgataaa 


gtgatcaaga 


54900 


caacattgga 


gcacccaaat 


ttacaaaaca 


54960 


gttctcactt 


ataagtggga 


gctaaatgat 


55020 


acactgagat 


ctatttgagg 


gtgaagggta 


55080 


ttgggtactg 


ggcttaatac 


ctgggtgatg 


55140 


taagtttacc 


tatgtaataa 


accttcatgt 


55200 


atattactac 


acctaaagaa 


agagaaagac 


55260 


gacttaacca 


ccccacttgc 


agcactagac 


55320 


acattggcct 


taaattggac 


tttagaccaa 


55380 


caatcttggc 


tcactgcaac 


ctccacttcc 


55440 


cccaagtagc 


tgggactaca 


ggcatgcacc 


55500 


gtagagacgg 


ggtttcagca 


tgttggtcag 


55560 


ccaccctcct 


cggcctccca 


aaatgctggg 


55620 


agaatatgca 


ttatttttat 


cagcacatgg 


55680 


ccacaaaact 


agccttaaga 


aatttttaaa 


55740 


tgacaatgca 


ataaaggtaa 


aaatcagtac 


55800 


atggaaacta 


aacagcatgc 


tcctgaacaa 


55860 


aatttaaaat 


ttttatgaaa 


tgaaaagaaa 


55920 
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aacacatctt accaaaacct gtgggataca 
gcattaaatg cctacatcaa aaaaagcaga 
5 tctcaaggat gtagagaaac aaggagaaat 

aagatcaagg caggactaaa taaatcagaa 
tgaaaagttt gttccttgaa aagataaaat 

10 

gagagaagat tcaaataaac acaatcagaa 
cagcaataca aaaggctctg acataaggat 
15 ctcactgtga aaaaaaatgg aaaagtgata 

gtattaatga atgagttctg ttatagggaa 
agagacagtt gaaacacact tcaaaattat 

20 

ttatccagca tatgcacgtt acactggata 
cacctatggc ctgccccata cctatgcatc 
25 tggagactgg ctgaagatgc agcatcaata 

tgaggctgca gtgagctatg atcacgttac 
aagacctcca gagtggatca aggggatata 

30 

ggtttggaca ataataatct cccttgtatg 
cttctataaa ctcaattaaa aggtgacaaa 
35 ggtcctgaat gtcccacgtg cactagggct 

tcttattatt tctgtcaata cagtttggtt 
agttacttta tagagttaga caaatgctaa 

40 

tagccataaa atataatgaa aaaggaaaaa 
gaagtacact ggaaagcctc tataattaaa 
45 atagaccaat agaaagtaaa agaaagacca 

ggctgtgaca aaggtggcat ttcaaatgcc 
tgccctgggg cagccaggta gacttttgga 

50 

cattcacaaa aaaatacatt ttacatgtat 
acaagtatta gaagaaaaca cagctaaatt 
55 tcaaacaatg actcaaatgc ataggtaata 

aatttttaaa atttttcttg acagaaaacc 
aattggagaa tataccaaag acaaagacct 

60 

aaattaaggg acaaaggtcc aggaatctga 
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gtgaaagtag 


ttctatgagg 


gaaatttata 


55980 


aagatgacaa 


attaacagct 


taatattgta 


56040 


caaacccgaa 


attggcagaa 


gaaaaaaaaa 


56100 


atgaaaaaag 


aataaaaagg 


atcaacgaaa 


56160 


tgataaacca 


ctagctagac 


taataaaagg 


56220 


ttgaaaatga 


agacattaca 


attgatacca 


56280 


ttgggtgctt 


acttgggaag 


tgacccaaga 


56340 


ccaaaaaaag 


gaagaaagtc 


aatagactat 


56400 


ctggtgctta 


atcctgttgg 


agactctcta 


56460 


ttctcttagg 


ggacagagaa 


gctagggtag 


56520 


ggaaaaattc 


ttgtgggatt 


aacttcccag 


56580 


ctcggaaaag 


aaaggagttg 


tccatagatg 


56640 


cagcatttgc 


tgcagagttt 


gctcagagtt 


56700 


tgtactctag 


cttgagcaag 


accctgtctc 


56760 


gatgctgcat 


caacagcatt 


tgctgaagta 


56820 


tctccggtcc 


cttctagaaa 


gttggcacca 


56880 


tttcatcttc 


tacaatatct 


ggttaaacct 


56940 


tagccaccta 


tatgtcctca 


ttcagatgcc 


57000 


catgttattc 


gtctttacaa 


aaagctattg 


57060 


agttcatttg 


gaaaaacaaa 


catgcaagaa 


57120 


ttacacagga 


gactaaccct 


acaggacatt 


57180 


acattgttga 


aataacacat 


gaatagacaa 


57240 


gaattagaac 


caaatgcata 


tagaaattta 


57300 


gctgaagaga 


agatgagcta 


tttaataaat 


57360 


aaaatttaaa 


attagaccca 


catctcatag 


57420 


taagaatcaa 


aatgtaaaaa 


gtaaaacaat 


57480 


cctgtttaat 


cttagtgtta 


ggaaaggctt 


57540 
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accctaaaca 


aaatcaaaag 


acactgtcaa 


57660 


aaaatcccta 


gtatgtaaag 


aactctttaa 


57720 


tagaaaaaat 


gggtaaagaa 


gtaaaatttg 


57780 
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ggttaggcat 


ggtggctcac 


tcctgtaatc 


cctgcacttt 


gggaggcega 


ggcaggagaa 


5 


tcatttgagg 


ccaggaattg 


aagaccagcc 


tgggcagcat 


agtgaaatcc 


tgtctctaca 


aaaaatttta 


agatttagct 


ggccacagta 


gtgcgtgcct 


gcagtcccag 


ctattcagga 




ggctgaggca 


ggtggattgc 


ttgagcctgg 


gagtttgagg 


ctgcagtgaa 


ctatcactgc 


10 


actccagcct 


gagcaagacc' 


ctgtctctaa 


aaaatacaaa 


caacaacaaa 


aaagtgaagt 




ctgactccag 


ctaccttttc 


atctttttta 


cattcaatgt 


ccttcactgt 


gagccattcc 


15 


caacatatct 


ctaccttctc 


ttccccaaga 


cttttccctt 


tccataccct 


caacggtgtt 


ctttttgttt 


tgaccatctt 


acaatgttct 


agccatccca 


tcaacttcat 


ttgttcttca 
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ara taraart 
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atttttatac 


20 


tgtgcaattc 


acacgatgta 


tgtcatagat 


atgtggaatg 


ccctcccatg 


gatggcttct 




gattttctat 


tggctctgac 


ttggatgaaa 


atagatctaa 


gttgattaca 


agtctcattt 


25 


cactcgctgt 


gattctatcc 


ctttctggac 


tgcagtttat 


tgatttcata 


tgccaccttc 




tgtttgaagt 


cttccctaat 


ctcttcaagt 


ccaacaaatc 


attttctcct 


ctgttctcct 
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30 


atacagcaca 


ccctgggtgc 


ccagttcatt 


tctgtgtctc 


ccagagcaac 


cagcccaacc 




ctttcttctc 


catggagatt 


ataccataga 


ctcttgtcaa 


atcaaaccat 


atttaactgg 


35 


ctcattaacc 


aaaacaggaa 


ataaatacga 


atgaaactga 


gctctaagca 


gcatgtaacc 




tggcctgcat 


ccaggaaata 


gaggacttcg 


gatccttcta 


accctaccac 


ccaactggcc 




in* Vm> d ~j ' — CI V^> d. 1 — t— 
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aaaaaaaaaa 
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caacratcccc 


acacrcaaaaa 
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aaaaaaat acr 


40 


gatcaagaga 


tacgtggctg 


ctggcagagc 


aaggtgagtc 


tgettaatet 


ggtggttggt 




aactgggagt 


tggtgataag 


gtttttctac 


caagtgagac 


ttacggcgaa 


ccagtgtgga 


45 


tgagggatag 


acagagtgta 


ttgtactagc 


tttattgagt 


ctgctccaag 


agaaagaaag 




agaaagatag 


gacatggagg 


gggatgggag 


aaacaaaaag 


aaaaagaatt 


tttaaaacca 
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taaacaoaca 


50 


ctgccttggc 


aatggggctt 


cccttgagaa 


ggataatctc 


catttaacta 


atattgacat 




acctgggacc 


ttggaggatt 


cttgtccact 


ctctcaccct 


gtttcctttt 


ccagaaaatg 


55 


agagaattga 


actcaatgtt 


gtccaagaga 


ttttctagtt 


ctgaactttt 


gctttcagcc 




agtactaaca 


tatgcaaagc 


agcactctaa 


acactatgag 


aggcegggtg 


cggtggttca 




catctgtaat 


cccagcactt 


tgggaggccg 


aggcaagegg 


ategectgag 


gtcaggagtt 


60 


cgagaccagc 


ctggccaaca 


gggggaaacc 


ccatctctac 


taaaaataca 


aaaattagct 
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gggtgtggtS gcgggggcct gtaatcccag 
ttgaacctgg gaggcggagg ttgcagtgag 
5 gtgacagagc cagactctgt ctcaaaaaaa 

aagatggaaa agatggaccc cttgccctct 
ttctcatctg taaaactggg cgggtgagag 

10 

cttaagtgcc ctataactga tatctattat 
ggttaagatg ctccaatgag ctagaagaag 
15 tgtgagggaa aattctctcg catttatttc 

ttccggtggc caattctgtg ttggtggtgg 
gaattatgtc tcacgtgccc ctgtatccaa 

20 

ggaacccacc tagtttggtg tcgaatgtga 
aaggcaaaac cttgggggta agtgagattt 
25 agtgatggca gggggaaggg aaggcctggg 

ggctgagttg atcgcgcctt gccaagcaca 
cacaggcggc ctctgctcag accttgtatt 

30 

gaaaagcaac aaaaaagcag cacgcttgaa 
gccctgtggc ctctgaagaa aggggtccca 
35 cagaatccaa taagatcgaa tcaaatattt 

ccaacaacat aaatggtcac cgcaacccct 
gcccatttca ctgtgtagct gaatgatggg 

40 

caaccctagc tccaccctca gatagtgctc 
gagggaaatt tgtatctgcc aaatttacaa 
45 atcctccaga tatgcctgtg cccacaaaaa 

cgtggtttgt gataacaaaa atattgtaat 
atattggtta agtcagtgta atatactggt 

50 

gtggagctat gaaaagaaca agaaagctct 
atggagttaa tggaagaaag taaggggcag 
55 gaatttaagg aagtaatata tttgtatttg 

agatgttctt tcatttacaa tgggattatg 
tcataagtaa aaaatacatt taatacatgt 

60 

ctaccttaaa caagttcaga acacttaggt 
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atactcagga 


ggctgaggca 


ggagaattgc 


59640 


ccaagatcgt 


accactgcac 


tccagcctgg 


59700 


aattaaaaaa 


taaacactat 


gagagacaca 


59760 


ggtagccttt 


agcctctttg 


agtcttagtt 


59820 


gatgaaagga 


aatgtatttg 


taaagcacct 


59880 


ttgtaacatg 


ggattcataa 


acttgtagag 


59940 


caaaagtcct 


ttttaaagga 


caagtccata 


60000 


ttggcagcat 


gaattcgatg 


acttcagcag 


60060 


caccccacaa 


tggttatcct 


gtgaccccag 


60120 


acagccagcc 


gcaagtccac 


ctagttcctg 


60180 


atgggcagcc 


tgtgcagaaa 


gctctgaaag 


60240 


ccctttgcag 


gccacagact 


gcacagctgg 


60300 


aaatacacaa 


acactacatc 


ccgcaagtgg 


60360 


ggtctattag 


aaagctcagc 


tgcacaacat 


60420 


ttctgattta 


ggcattttcg 


gaaagagaaa 


60480 


ggttgatcct 


gttgccagga 


tccacgccca 


60540 


gcaagaccaa 


aatagaccac 


aaggtggagt 


60600 


gcccaacagg 


aaacaaagca 


ggttaggcaa 


60660 


gagaggctgt 


cctgcgctgg 


acaaagaaag 


60720 


catgaatttg 


gatgcagact 


gcccgagctc 


60780 


atgggaacaa 


aaattgatac 


aacccttagg 


60840 


aatatccaat 


aattccattg 


ctggaacttc 


60900 


ctgtgatatg 


tatgagctta 


ttcattgcag 


60960 


atccatcaat 


ataataccgg 


ttaagtcagt 


61020 


taagtcagtg 


catccaaaga 


ataaaatgta 


61080 


ttgtgtaatg 


atagggacac 


accccaagat 


61140 


acactgtgta 


taatttgctt 


acatttatgt 


61200 
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61260 


tcccagtgta 


cccatcgtaa 


attaaaaata 


61320 


agcctaccaa 


acatcatagc 


ttagcctcgc 


61380 


tagcttacag 


ctgggcaaaa 


tcatctaaca 


61440 



-309- 



WO 02/062946 



PCT/US01/48437 



• 


caaagcctat 


ttaatagtgt 


tcttgaatat 


5 


aatgaaaagc 


agaatgattg 


tatgggtact 




ctttcacacc 


ataataaagt 


tgaaaacttt 




cccaggccgg 


agcgcagtgg 


L.gt- dy ll tuy 


10 


agcgattctc 


ccgcctcagc 


ctcccaagta 




gagctagttt 


ttgtattttt 


gtagagacag 


15 


aactcctgac 


ctgtgatccg 


cccgtctcag 




gccactgtgc 


ctggcccaca 


tggcattttt 
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CCCCaytgLy 
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20 


caaatagagt 


cacattctga 


ggcactcagc 




gggttggggg 


aggggagcat 


aatttaacac 


25 


aggcaaagat 


ttacatagaa 


ggatgtttat 




aaataatctt 


acctgtccaa 


taactctgtc 




a ecu gage ag 


ccccgaaaaa 
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aCatdCgtLC 


30 


catgatgaaa 


tattaagtta 


ataaaattca 




tttgaaaaga 


ataatgggag 


aatacaccaa 


35 


gggagattac 


aggtagtttt 


tatttcattt 




gaaaataaaa 


catttattaa 


aattaaaagg 




ag aaacaaga 
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40 


ctcacatcgg 


cctcggctcc 


ateatggega 




cattctaegg 


aggctttccc 


ttctggggag 


45 


ccgatgagct 


cccagaaggt 


gccaaggcac 




tgccctgcag 


aaggttccca 


gccacccaaa 




cct cgtcccg 


caagcacacc 


agutxgguag 


50 


ttggatggct 


taaacaacag 


aaatttgtta 




atggagatgt 


tagcagatgt 


tttcccctaa 


55 


cttctccctg 


agcctttcca 


cggccttcct 




tgaggacacc 


agtcattttg 


gattagggtc 




eggttttaac 


aaccctatct 


ccaaacacag 


60 


tcaatctacg 


aaatttggtg 


ggggatagag 



ctaatgtagt 


ttactgaaca 


ctggactgaa 


61500 


ecaaataegg 


tttctactga 


attcatgtca 


61560 


gtttttttga 


gacagagtct 


cactctgtca 


61620 


gctcactgca 


acctctgcct 


cctgggttta 


61680 


gctgggatta 


caggcaccca 


ccatcatgcc 


61740 


ggtttcacca 


tgttggccag 


gctggtcttg 


61800 


cttcccaaag 


tgctgagatt 


acaggtgtga 


61860 


tataaggaca 


ccaatcatat 


tggattggag 


61920 


ggctgattat 


atetgeaaca 


actctgtttt 


61980 


attagaattt 


caacacatga 


atttggagga 


62040 


ataacaatat 


cctaaagaaa 


taagagatga 


62100 


cacaatgtta 


catataatag 


aaaaaagtag 


62160 


tcaactactc 


cacagaccat 


ccataatggg 


62220 


caagaatact 


taatgccatg 


aggacacatt 


62280 


gggggccatg 


ttgtgctatc 


aaggtatata 


62340 


aatgtaaaca 


gttgttttct 


ecaggeggag 


62400 


ttgtatttcc 


catattactt 


tttagaccat 


62460 


aggctcatag 


gggaggcagg 


acccagcctc 


62520 


gaeaggecat 


ccagatcatc 


attggcctgg 


62580 


cggttctcgt 


aggggaatac 


ctgtctattt 


62640 


gcttgtgggt 


gagtaactca 


agtcctcctg 


62700 


tcaaggccac 


ettgecaagt 


tgtgcagctg 


62760 


tgtgggggat 


aggaaaccag 


ccagccctgg 


62820 


ggccggccca 


acagagtata 


gcaccacagg 


62880 


tctcacagtt 


ctggcggctg 


aaagtctgcg 


62940 


ggtctctctc 


cttggcttgc 


agaaggtcgt 


63000 


ccatgcctgt 


gccctagtct 


cctcttccta 


63060 


ccccctgatg 


acctcattta 


acctgaatga 


63120 


tcagattctg 


gagcactaca 


gattagggct 


63180 


ggtgtggaca 


caattcagcc 


cataatatca 


63240 
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agctacaacc 


ccgcaaggtg 


acatcccctc 


aaaagggatg 


tattttcttt 


ttactttttt 


63300 


m 

tttttttttt 


ttttttgaga 


tggagtctca 


ctctgtcgcc 


caagctggag 


cgcagtggtg 


63360 


caatctcggc 


tcaccgcaac 


ctccccctcc 


cggattcaag 


caattctcct 


gcctcagcct 


63420 


ccctagtagc 


tgggattaca 


gacgcgcacc 


accacgcccg 


gctaattttt 


gtatttttag 


63480 


tagtgacaag 


atttcaccac 


gttggccagg 


ctggtctcaa 


actcctgacc 


tcaggtgatc 


63540 


cacttgcctc 


agcctcccaa 


agtgctggga 


ttacaggtgt 


gagccaccac 


acccggccaa 


63600 


aagggatgtg 


ttttcttaat 


tcacaaagaa 


aacatccaca 


tgtaaaaaat 


acactgggaa 


63660 


tagcccctgg 


cattgttgga 


cataggccca 


caagcctgag 


ctccaggaga 


aaaacactca 


63720 


gcaaaggtga 


agagtggctt 


cccagctgta 


gaacagcaag 


ggtttgagtg 


cttgagcact 


63780 


ggggactcaa 


tctagatctg 


cagatgggct 


ttgggggctc 


cattttggca 


aaatacctaa 


63840 


tatgctaaat 


gtgtattatt 


tttctggaag 


ggtccatagc 


tttcatcaga 


ttatcagagt 


63900 


gatccgtgac 


tccataaagt 


gttaagaact 


acagtccttc 


ttccacatca 


gtttattacc 


63960 


tttatggttc 


ctttgcagtg 


tccccctgcc 


cacacacaca 


cacacacaca 


cacatacaca 


64020 


cacacacata 


caaacaaaca 


cacatacaca 


cacacacaca 


tacacacaca 


cactctctct 


64080 


ctctctcctt 


ctcctcccct 


ctccaccttc 


ctttctcctt 


tctctccctt 


ctgttacagt 


64140 


cttcctcctg 


actcacacac 


ctacatgtga 


cactaatgca 


actttcctca 


taaatggcac 


64200 


aactccagag 


taacctaggt 


gccagcccaa 


gtccctctac 


cttataagag 


ttaaataaaa 


64260 


tagcctagag 


ctatctccat 


ccccgacagc 


cacacaagtt 


ctgcaaatga 


acagggctcc 


64320 


tctagcgagg 


ctctgcacac 


acccaaactc 


agaactcagc 


ccctttccct 


cccacaacca 


64380 


ctcacaagat 


acctgttcct 


gtatctctga 


ccttcgtgga 


atgtggggct 


atggagaggg 


64440 


ctagcatcac 


ggccacccag 


gcctaccatt 


gagcacacaa 


tgcctaccac 


ggagcactgg 


64500 


attccaccaa 


gtgccagtga 


gcagcaggca 


gttgtcctgg 


catctcaaat 


gcctcactga 


64560 


ccacgcctcc 


tcccaccagg 


ctgctgagtc 


ccttcagccc 


tcatgacctt 


cctccttttg 


64620 


ctctgcaggg 


cagttgtgct 


ctttgctgcc 


ttcttgatcc 


caggactgtc 


tttctatttc 


64680 


ccaaagatca 


tcaatatcct 


cttctgaaac 


aaaggctcat 


ctcctctggc 


acctcagaag 


64740 


gactgagagg 


gataccaaaa 


gtccagccct 


tcctccccat 


caaatctcta 


taaacagaag 


64800 


ggtgtgagcc 


tgagatttgt 


ttttccactc 


ataggagaat 


atggccttgg 


ccttggaagg 


64860 
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tctgccgtgg 


gtgggaggga 


gtatggagtg 


agggttagga 


acatggagtc 


tgaaatcaag 


64980 


ctgtctggat 


ttgattctca 


aatgcatgtg 


ccatagagtg 


gaaacactta 


tttacaatga 


65040 


aaactccgct 


ggatgatgtt 


gtgtgagttg 


ttcaaaatct 


ctaagcctca 


gtctcctcat 


65100 
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tgataaaatg gcaagagtaa tactgcttga 
ttgaacaagt acttcatctg ttggggggag 

5 

cttcttttgg ggccgttgca aatgttctaa 
ataaatatac taaaaataat tgcataatat 
10 aattatatgt caatgaagcc ttttctaaaa 

agctcctatt aaatatagct catgccctct 
tgatggcagg caatcccatc agccacagtg 

15 

cgcaattgac atcatcatcc tcgtatttga 
ctgagtaact aaaagttatt tgtccagata 
20 ctatctaaaa gcagcagaga agacgctggt 

aaacctaact gaatcttgat ataaagattg 
aaaatacaag gaaataatcc aacatttgag 

25 

ccagatgtag ggcataagtt gtctcatgac 
ccattttgta gatgaggaaa taaagactta 
30 gtttctagcc agaatttaaa accaggtttg 

ccagtgcctt ttcaggcagt gcaaaacagc 
tggttccagc aaaggtaccc actgtgtgtg 

35 

ccctagcccc tgacctcttt ctaaacccac 
tctctctccg tggcagcaga aaatcagcca 
40 agaccagctc ttccaactgg agacctatag 

catacgattc ctcccccagc cttgcaccta 
ttcccgctca tacagactgg ccacaggagc 

45 

caagaggatg cagaagtgag aaggggcaat 
ggagagcgag agagagagag agagagagag 
50 gagaggcaca gagctcccag gggcccaggg 

acttccagta cagagaagag gactccgggg 
ctaaggaatc cagccaacag aactctgata 

55 

agcaacagac caagaaatac cttcatttgt 
gaccaggtcc ctacaaccag atgagatcta 
60 gtggaagcgt atctttctgc acatggccca 



tgggccacca 


tgtggttaaa 


tgaggcgagg 


65160 


ggcggtgatg 


ggttgctagt 


cggtacgtgg 


65220 


cttacattgt 


agtgatgatt 


gcacaacttt 


65280 


actttaaatg 


ggtgaattat 


atgatacatg 


65340 


aagcattttc 


tatagtgccc 


cacaaatggt 


65400 


cataccactc 


ttgctagccc 


actgggcaaa 


65460 


cctgcggaat 


accatgttgg 


atgaaatgca 


65520 


tgttcagcat 


catagcgaca 


caatgtttat 


65580 


tgcaggcagg 


caaggattga 


tgtcaggctt 


65640 


aaccccttcc 


ttctcccctc 


cagagccccc 


65700 


gaaaacaata 


gggtgtactt 


gtgggaaaat 


65760 


catctactaa 


gtgtaaagat 


atactaaatg 


65820 


aatgacactt 


gggaagagct 


ataaattttt 


65880 


gagaagttaa 


taacttaccc 


aaggtcaaga 


65940 


ctggatccta 


gaatatatat 


ttatcccacc 


66000 


tcattttttc 


tgttgcccat 


gttgagctga 


66060 


accagaatca 


cagaagcctg 


gggaagggca 


66120 


tctgttctgt 


accagtttat 


catttcagga 


66180 


tattcttatt 


gcctggtaag 


ttacattctg 


66240 


aatggtgtca 


caaaggggaa 


aataaggtca 


66300 


ctataaccag 


acacacggtg 


ctcacacaca 


66360 


ctggccctgg 


agtcacatgt 


ggaaaagtcc 


66420 


acagtcacct 


tagttaaggc 


atcagagtat 


66480 


ataacagctt 


tcctaagaag 


ccctgccaca 


66540 


ctgccctcta 


gcagaccagg 


agccaagagt 


66600 


aagccgaagg 


aggctcagtc 


actcgctttc 


66660 


atcaacatgg 


acaaagcttt 


gtgctcccct 


66720 


ctcataatca 


ggttacgaag 


ggtaaagggt 


66780 


tttgaaagtc 


ttcaaaagaa 


agctcaaaat 


66840 


gaatcatgga 


gcatctgagc 


ttatcacatc 


66900 
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ccaagacact 


tgggctaaga 


ttctatttat 


cctatgggac 


aaataggtca 


agtgtttccc 


66960 


agagattatt 


cttttttctt 


tttctttttc 


tttttttttt 


tttttttttt 


tttgtgtgtg 


67020 


tgtgtgagac 


agagtctcac 


tgtgtccccc 


aggctggaat 


gcagtggttt 


gatctcaact 


67080 


cactgcagca 


tctgcctcct 


gggttcaagc 


aattctcctg 


cctcagcctc 


cctagtagct 


67140 


gggattacag 


gtgcacacca 


ccatgcccag 


ctaattattg 


tattttttgt 


attttttggt 


67200 


attttggtag 


agacggagtt 


ttgccatttt 


ggccaggctg 


gtcttgaact 


cctgacctca 


67260 


ggtgatccac 


ctgcctcagc 


ctcccaaagt 


gctgggatta 


cagatgtgag 


catccgcgtc 


67320 


cagccccaaa 


gattattctt 


gagaacactt 


ctcccatgat 


atgctcaatg 


aagatagtta 


67380 


catgaacttg 


ggaaaatctc 


tatgtattgt 


ttcctctctg 


aaagatctaa 


agtcctcatt 


67440 


aaaggctctg 


agaagtccca 


ccacgatgat 


ctgtttacct 


ttgcccaacc 


cagctatttt 


67500 


caaagctaac 


ttaccccaaa 


acctttcttg 


cacagaacac 


ccctttactt 


gcagtggaaa 


67560 


tagggttcct 


gggattcaca 


ttaagaaaca 


ctttttgatg 


aaaagagtta 


tagagatgga 


67620 


tggtggcaat 


ggttgcacag 


cattgtagat 


gtgtttaatg 


ctactgaact 


gtaacacttg 


67680 


aaaatgatta 


ggatgttaaa 


ttttatgtta 


catatatgtt 


aacacaattt 


tagaaattgg 


67740 


ggggaaaaag 


aaacacttgt 


tgggtctcca 


ttcatctcag 


ctataacagc 


ttttctcgcc 


67800 


catcctgacc 


ctctgtgtct 


tcttcagctg 


tctggcagtt 


tgggcttgaa 


catcgtcagt 


67860 


gcaatctgct 


ctgcagttgg 


agtcatactc 


ttcatcacag 


atctaagtat 


tccccaccca 


67920 


tatgcctacc 


ccgactatta 


tccttacgcc 


tggggtgtgg 


tgagtatccc 


tctcaaccaa 


67980 


agatcctcta 


agttctgaat 


tagctacatt 


tagaaaactc 


ccagaaaggg 


cttgacaacc 


68040 


aagctacctc 


actgaacatg 


gtaggaaaag 


gaggtgcttt 


cagcatagga 


acaaatttta 


68100 


aagtccttga 


cccatagctt 


taagcacatg 


acctttccca 


gtttgggttt 


ctgcctttgt 


68160 


agaacaatac 


ctaccttgtg 


gcaatgttga 


gagaattaaa 


tgcaatttta 


caaagaatgc 


68220 


atatgtgccc 


accaaccagc 


acctctcctc 


acattctctg 


ccttcttgcc 


cattactata 


68280 


agccagaggt 


tagcaaactt 


attctctaaa 


gagccaatag 


taagcagtta 


aggttttgca 


68340 


ggcctacagt 


ctctgtcatg 


tagctatagc 


tgccttacag 


cagccacaga 


aagcactaaa 


68400 


caaatgaata 


tggctgtgtt 


ctgataaaac 


tttatttaca 


aaacaggcag 


caggcaggat 


68460 


ttgacccatg 


gatcataatt 


cacccacctc 


tgctatacac 


tagcactgtc 


caatagaaat 


68520 


atacaaacca 


catgttattt 


taaattt tct 


agttgccaca 


tttaaaagag 


agagagagtg 

— t ~J — * — * — > 


68580 


aaagcaacag 


gtgaaattga 


ttttaataat 


ctattttatt 


taaccccgta 


tatctaaaat 


68640 


attgtcattt 


caacatgtaa 


tcagtataaa 


aacttggttt 


gctattttgc 


atttggggaa 


68700 


ctaaatcttt 


gaaatccagt 


ttatattttg 


tgctatggca 


catctcgatt 


tgggaccagc 


68760 
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tacatctcaa 
tacagatgag 
tcctctccta 
atcagtcttt 
aaaaaacaaa 
tctctggccc 
tcctcttctc 
aaacacctgc 
ttgcttgact 
tctgtttttc 
tttgtctatc 
atacatggct 
ctcaatagct 
ctctactcag 
aaaaccttgg 
gcaatcctgt 
ctctactccc 
tgactgttcc 
gggatccttt 
agccaatgtc 
ctgctgccat 
cactccaggc 
cctcagatgt 
caaccacctt 
cgcttccctt 
gacttatttg 
ggatgttttt 
ggttgacact 
tcagtaaaca 
agagacttaa 



gtgtttaaaa 
ctacctgtgt 
cctgctcaag 
ccccctttgc 
ccaacccacc 
tctttgcaga 
tcatattctc 
tcttctccaa 
tccaggctcc 
ccaatctctt 
tcatttagtc 
ccagccaaat 
catagttcat 
ttttcctgat 
agtcgtcatt 
tggttctact 
acgctggcct 
ctttctccta 
ggaaacacaa 
cttccagcag 
cctctggttc 
atctttccac 
tcccacagct 
cccaatgagg 
ccccttatct 
ttacatttca 
ggtttcactc 
tcataagcat 
tcaacagcca 
agacttatct 



gccacatggg 
cccacgtaaa 
ttctcctccc 
tggggctttt 
tcctcttcgt 
caaattctta 
tctgaacctg 
atccccaatg 
accctcactg 
aggctcaata 
tggcaacttg 
cttctcccct 
acatccaagt 
tccagttgat 
gaccctgtct 
ttcaaaatgc 
gagccaccgc 
ctctgggctc 
gatcccatca 
cctgaagatg 
tgctgctgaa 
ctttgggcct 
tcctctgtcc 
cctacctgaa 
atctttcctg 
tgcttactgc 
actgtcatac 
ttgtcaagga 
ttaactgttg 
ttccaagatt 



aattgtggtt 
ttcaatcctc 
ctctccctca 
gcatcagcat 
tgtacttcct 
aaagaattgt 
ctccaattag 
gggtgattga 
gttgccctct 
attggttctc 
aaacaccttg 
atacaccagt 
ctgatctcca 
ggcaactcca 
ttctctcaca 
attcagaatc 
atctctcatc 
actgcagctg 
cttctctgct 
ctggccctct 
ccacgggaac 
ggaatgttcc 
cttcctcacc 
caacccaaca 
tagtctatgc 
ctgtccctag 
ctcaaatacc 
ataaatgaat 
tttgtatttg 
cttaggcaaa 



gctgaataga 
tgtctgcgca 
tcccgtctct 
taagcatgct 
ttatccacta 
ctacacttta 
actttcagcc 
tccttcttct 
tctctgtctc 
tggttttctt 
tctatgacaa 
ctagcatgtc 
gatttcactt 
tccttccagt 
cctcacagcc 
ccaccatttt 
taatcatgga 
tttatagcat 
ggcatcttac 
ggtgcccccc 
ctccatgatg 
ctccgactgg 
tccttcagat 
cccacactct 
atctaacacc 
aatgagattc 
tagaactgtg 
gtgccagctg 
accagatcct 
gcctctccct 



acagggcagc 
actgagtccc 
cctctacatc 
tttgtttctt 
ccactgtatc 
gtgagtccaa 
tcacttccct 
cggagggact 
ctctgctagt 
tctgtactca 
ttgttctaga 
caatggtcaa 
cccaaacctt 
tgctcgggac 
aatccctcgg 
cccccagccc 
aacagcctcc 
ggccaccaga 
tcagaaaaaa 
gcccctggct 
ttttctccaa 
ggaactcttc 
cactaatctg 
cccagtaacc 
ctctgcaact 
ttaaggacag 
tctggcacac 
tgatgcatac 
tttatcttca 
agagatccta 



68820 
68880 
68940 
69000 
69060 
69120 
69180 
69240 
69300 
69360 
69420 
69480 
69540 
69600 
69660 
69720 
69780 
69840 
69900 
69960 
70020 
70080 
70140 
70200 
70260 
70320 
70380 
70440 
70500 
70560 
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tggaccaagt 


ccatgtcatt 


tatcaaaagt 


accttgagtt 


tctcccctcc 


cagaaacatt 


70620 




ttatggctcc 


cagatacttt 


aggattaaag 


tccaaattcc 


tcagccaggt 


atttgccctc 


70680 


5 


cctgggggat 


ctggctctat 


ctcactgtgc 


cctacagttc 


acctcccttc 


gaatctctca 


70740 




ctcaccctga 


accccagcta 


ctcccgtcca 


ctgcgagaga 


caggcacaag 


cgagtacttg 


70800 


10 


ggaattttcc 


tgctgggaaa 


gaaccaatca 


gggaggcatg 


gactgcccaa 


tagaagaaat 


70860 


taagaagcaa 


atgctggtgg 


aggaactaag 


ggaaaggaag 


atgggagaag 


agtttaggaa 


70920 




agactaaacc 


ccaagcagca 


cagccaggtg 


ccctggagta 


gagggagcag 


caatttttga 


70980 


15 


ttcagaaaac 


ttggctttta 


atcttagttc 


tgccaatgat 


tgatcttgga 


atgagaactc 


71040 




gccccatgcc 


caccacccat 


tccaccttca 


agcctcagtc 


atcccatctg 


aacatggagg 


71100 


20 


gtcggactgc 


atctgagctt 


tgcaaataat 


gacccaagga 


atggagatac 


tccggggttt 


71160 




ccacatcagg 


aattattttt 


catcaaataa 


aaaaaatctc 


agctgcttga 


gggacccctt 


71220 




ttccatgaaa 


tattttatgt 


gtaaaaaaat 


tttatttaaa 


aaaaattgaa 


aaagaccagc 


71280 


25 


atactttgcc 


caggccattc 


tactctaagt 


ctctgtgatc 


cgttgactta 


gacatcctgc 


71340 




actccccttt 


tcctgggaca 


actgcctcta 


ccctcaccct 


ctctaaatcc 


cttcatgaca 


71400 


30 


atgaacctct 


ccctcctcag 


tgctctggga 


tacctttttg 


ctgttctgta 


gctcttaccc 


71460 




tatctgatgt 


gaattataat 


tatcatctgc 


atctgttttc 


ctactagatt 


gtgcctctca 


71520 




ggatgcattt 


cttagccgct 


tctaccaagc 


agatgtttaa 


taaccacttg 


ctaaatggag 


71580 


35 


aggaaggaga 


agaaaagggg 


ctaaggagag 


gcctctccag 


aagagagaat 


tctccaagca 


71640 




tctggctgtc 


accaatgctg 


tttttaaaaa 


tcataattga 


gataattgag 


gccggtcacg 


71700 


40 


gtggctcaca 


cctgtaatcc 


cagcactttg 


ggagaccgag 


gcaggtggat 


cacttgagct 


71760 




caggagttca 


agaccagcct 


gggcaacatg 


gcgaaatccc 


atctatacaa 


aatatacaaa 


71820 




aattagccag 


gcatggtagc 


gcgtgcctgt 


aatcccagct 


actcgggagg 


ctgaggcagg 


71880 


45 


agaatcgctt 


caacccagga 


ggcggagctc 


acaatgctga 


gctcatgcca 


ctgctctcca 


71940 




acctgagtgg 


cagagcaaca 


ctctgtctca 


aaaaaaaaaa 


aaaattaaaa 


aatcataatt 


72000 


50 


gagaggcatt 


tctttggtac 


ctactgcatg 


ccatccacct 


tcactatgaa 


atgttctcat 


72060 




gtcacataat 


tggaacccaa 


atggtaatgg 


gagtctcctg 


gatggacctt 


ttagattgaa 


72120 




actcgcttcc 


cagatacact 


ggaagactcc 


atactcctgg 


gtattcttgg 


gtaacagcaa 


72180 


55 


atQQacfQQtt 


tctatttctc 


ccqqttacct 


cctcrctatqq 


t taatttact 


tcattttqqq 


72240 




gatcctttca 


ctgactgttc 


ttcccaagcc 


agagaaacag 


ccccaattcc 


tgccctccac 


72300 


60 


aaaacaccac 


tgtcacaagc 


caagaaagtc 


ttttcagagg 


ctgacatttt 


cccaaaggtt 


72360 




cagatgaagg 


ggtgggttgc 


cccttcacac 


ctgtgggtgt 


ttctcgttag 


gtggaacaag 


72420 
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agacttggaa aagaaagaga cacagagata 
ggaccggcgt tcagcataca gaggatccac 

5 

tttattgatc attattgggc atgggtaata 
aacacgtgaa caaatgtctc tgcatcataa 
10 gatgtgcata tacataaaca tctcaatgcc 

cacctccagc cctaaggcag ttttccccta 
acaccgagac attccattgc ccagagacga 

15 

actgcaaaga gccgttcctt cctcttttac 
tgttgggctg ggggacggtc aggtctttcc 
20 catggggaga aaccttggaa aatacctggc 

gcagtgtttt gtgtctctgg gtacttgaga 
gatgactctt tttttttttt tttttttttt 

25 

ggccggactg cggactgcag tggcgcaatc 
tcatgccatt ctcctgcctc agcctcccga 
30 gcccggctaa ttttttgtat ttttagtaga 

tctcgatctc ctgacctcat gatccacccg 
gcgtgagcca ccgcgcccgg ccggtgatga 

35 

tgtttaacaa agcacaccct gcacaaccct 
catgtttcag ggagcacagg gttgggggta 
40 aagaattttt cttagtacag aacaaaatgg 

acacagtaac aatctgatct ctctttcttt 
acatcagagc ccagcaaacc actgattact 

45 

ttcatttttc acaggaggcc ttttcctgtg 
ccttcttttt cctctaacca tctccctgtt 
50 cctaccgacg gcctccccag agcagtgtca 

ctcaggtctc cacacccaaa gcaacctatc 
ttctcagagg aacactacca agtaaatgtt 

55 

atacttacaa aaacccggtg ggaaatttcc 
tccgcaaagc cacaaagagt ccatggcagg 
60 gtcccgtgct cctgtcctcc ccatctgctc 
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aagtacagag 


aaagaaaaag 


ggggcctagg 


72480 


gtgggcactg 


gcctctgagt 


tcccttagta 


72540 


ggataatagt 


ggagagaagg 


tcagaaggta 


72600 


acaaggtaaa 


gaaaaaagtg 


ctgtgctttt 


72660 


ttaaggagca 


gtattgctgc 


cagcatgtcc 


72720 


tctcagtaaa 


tggaatatac 


aatggacttt 


72780 


gcaggagaca 


gaagctttcc 


tcttacctca 


72840 


taatcctcct 


cagcacagac 


cctttacggg 


72900 


catcccatga 


ggccatattt 


cagactatca 


72960 


tttcctaggc 


agaggtctct 


gtggccttcc 


73020 


ttagggagtg 


gtttgagatt 


agggagtggt 


73080 


ttttgagacg 


gagtctcgct 


ctgtcgccca 


73140 


tcggctcact 


gcaagctcca 


cttcccgggt 


73200 


gtagctggga 


ctacaggcac 


ccgccaccgc 


73260 


gacagggttt 


caccttgtta 


gccaggatgg 


73320 


cctcggcctc 


ccaaagtgct 


gggattacag 


73380 


ctcttaacaa 


gcgtgctgcc 


ttcaagcatt 


73440 


taatccatct 


aaccctgact 


tgacacagca 


73500 


gggttacaga 


ttaacagcag 


ctcgaggcag 


73560 


agtctcctat 


gtctacttcc 


ttctacacag 


73620 


tccccacatt 


cagagggatg 


gtggaggttt 


73680 


gaagataaga 


catccttgct 


ttgggaagtc 


73740 


gcctccccct 


tcccaccatg 


gcgctcaact 


73800 


ctctctgtgt 


cttgtccata 


ccctccaatt 


73860 


ctgctcccct 


ctctgactcc 


agagcacacc 


73920 


aaggggcttc 


cctaaacatg 


gtcatctttt 


73980 


gcaataacaa 


ttcttttacc 


cccatacttg 


74040 


ccccacgagg 


ctgagagagg 


ataggggact 


74100 


gccccagtga 


gaggtcagtt 


cggactccca 


74160 


tgcctacaga 


accctggaat 


ggcgatttct 


74220 
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ggcgtgctgc tggtcttctg cctcctggag 
ggctgccagt tggtctgctg tcaatcaagc 
5 gaaagatgcc cccaaaaagg tggagacaag 

aggagcagga gtagttcctc ggtgtccgag 
atgccccctg tgtgcctgtg ttttccaggt 

10 

aaacccagtg atcaccccag aaccggtgac 
agcaaataag taaggctaca gattctggaa 
15 cctccctttc tgggcttcca taacccaggt 

tctcccactg tttgtactct caccttcatt 
tctctatcaa gaagaagaca gagattttaa 

20 

gcacatgcat cagcacatat gtgggcatcc 
gtgtgctctg ctgcatgtga gcttgtgggt 
25 tcactctttc aattgtgcat tcatttaata 

gctctcttca tggcaggagg gggctggaca 
acttctggcc ttaggagaga ggacaccagg 

30 

ggcaacatag gggagaggca gaaggagccc 
aaggaaggaa tgaggagagc agaggacagg 
35 cattaaagga cagtaaggat tttccgtgta 

agagagaatg atcagtcctt ttagattctg 
tgcaagaatg tagaaggaat cttccagaga 

40 

ggactaattc caaagagaaa tggccatgtc 
gtaacagcat ctaaaaggac taatcctttc 
45 ccaaacagcg tcgtccctca tactgaaaat 

tgtcgcttcc tgaaacacgc aaaatttaaa 
caagaagttg cattctgtgg ggttatagca 

50 

tttccttagc tacagaagca ctgcaaggat 
aatacaaaat gtgattttgc tcttgtaaat 
55 tcttctaaat ggcatgcagt tagaggtcta 

tctgatcagg aatccttctg atgcccatca 
ctagaactaa atgggcctta ggaattacag 

60 

aatatgaggc atgtaatatc ctggttctag 
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tttqqcatcq 


catgcgcatc 


ttcccacttt 


74280 


aatqtqaatc 


ccagggttcc 


tcaqtqqqtq 


74340 


qqaqctcttt 


gtgctttcca 


ctgcctgctc 


74400 


gcctttggtg 


catggcccgt 


ccttcttagc 


74460 


qagtgtcatc 
—j -j —j 


tatccaaaca 


tctatgcagc 


74520 


ctcaccacca 


agttattcca 


gtgagatcca 


74580 


qcatctttca 


ctgggaccaa 


aagaagtcct 


74640 


cgttcctgtt 


ctqacagctg 


aggaaacgtc 


74700 


cttcaattca 


gtctaggaaa 


ccatgctgtt 


74760 


acagatgtta 


accaagaggg 


actccctagg 


74820 


agcctctggg 


gccttggcac 


acacacattc 


74880 


tagaggaaca 


aatatctaga 


cattcaatct 


74940 


aatagatact 


gagcattcaa 


tqtgttcaag 


75000 


ttgacatgac 


acagtcttgg 


acctcaagaa 


75060 


acatcacqqg 


agcaattttc 


actgacaaag 


75120 


agagtgaatt 


tattctggct 


gtgatgcaga 


75180 


tctqqttqqq 


gtagcaacat 


ctqaatqqqq 


75240 


ataaqqqacc 


acqtqqttca 


qqaqqCCttC 

3 3 3 3 


75300 


ttaccaatga 


ggcaaccaaa 


ttcacgaatg 


75360 


cataaggcca 


atacatcatt 


atctgtaaaa 


. 75420 


attcctatta 


ccatttctgc 


atttaccgtg 


75480 


agcagacgac 
~j ~j -j 


tcactgtctc 


caaaggtctc 


75540 


ccctcactgt 


cctttaatgg 


cccatttaga 
— > 


75600 


cattgttatg 


tatgataacc 


aatgggcatc 


75660 


atcatctcta 


aaatccagct 


gttgtcaagc 


75720 


ttgtttgttt 


gttgtttgtt 


ttggttacta 


75780 


aactttgttc 


ttggggttat 
> — > — > — > 


ctaagaagtt 


75840 


tcgcctgtgc 


acctgcatct 


tctcattttc 


75900 


gctagccagg 


aacaggagtg 


ttttcatttc 


75960 


acctcctgaa 


aaattgattt 


tttcaaatgg 


76020 


atcagtcctt 


cacatacctg 


ttttgtaagc 


76080 
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caggaggtta 
ggctcggtaa 
gctgccccat 
accctccatt 
atattgctaa 
tacatctggt 
gcttgttgga 
cccttgtcct 
tgcattctct 
actccataac 
ctatttaacc 
tgcaatttaa 
cataaggcct 
cacaggagtg 
gtgctcattt 
catgctcttg 
ttacccagtc 
gaaatttgcc 
cagatggccg 
tgactaagat 
gtgccacttc 
atgactgctg 
agcccggatc 
caagtgagtg 
ggcagctcca 
tgccttaagt 
gaacacttag 
tctggctcag 
tccggcaaac 
agcttggttg 



ggacattgag 
acagagcaag 
ctaactccac 
ctgttcacag 
catagctcgc 
gcctctccat 
acagattctt 
ggggggacca 
gcaccacccc 
aatggaacat 
atagcactgt 
ttgccattgt 
ctgccctcat 
ggcttctgat 
gcccttcagc 
gacttcccag 
tctggtattc 
caaggtcatc 
gttcagggct 
taaaacactg 
accagccaga 
agctggctcc 
tcacacccac 
agagtggggt 
ggggccagca 
ggcttccacc 
agatgccagc 
ggagtaccaa 
gagagcatgt 
ttttacagct 



ctccccaaaa 
cctgcccctc 
acctccttct 
agcctccttc 
ttcctgcagt 
atgggggcac 
tcctatctta 
attgccaacc 
acaagccctc 
aaaacccgga 
ggcagtgcaa 
aatggtattt 
gaatggagta 
agaaggataa 
catgttatga 
cctccagaac 
tgttatagca 
aacccagcta 
ccttgcagaa 
tcttcacttc 
gacttccgca 
tcccacttgg 
catgggcaag 
ctggccatgg 
caggcacaag 
ttgagcactg 
aactgcagag 
gatctgggct 
caccacccat 
tgtttggtcc 



gggacccacc 
tcaccacctc 
catcctacag 
ctgacatttc 
gtgggtcatc 
ctgaccctgc 
tcacctacta 
tagggtgaca 
cacccacagg 
agagatctca 
tggtttaaat 
aaaagtggag 
atgccattat 
attttggtcc 
tgcatcaggg 
tgtgagaaat 
atgtaaaaca 
gaggtatagc 
ggaggagcat 
cttagcttga 
gccagcagca 
cccagcaggc 
ttaggcatgg 
tacacctgtg 
ctctgtgcaa 
gcatttggac 
ccccaaatgg 
cccataaggg 
agctcggcaa 
tgctgccctg 



tgtttgcacc 
ctttcttatt 
aactctaccc 
tgacctatga 
ccagagaaag 
cctcctgaaa 
tattagtcct 
gctcattgca 
gctccatccc 
gagatctgta 
ctcccctcca 
tctttaaaag 
tttgggagtg 
ttgtcctctc 
ccctcagcag 
aaatttattt 
gactaagaca 
caggtccaga 
gtggtaaaaa 
aatgacacag 
cctctgctag 
tgcgctcagc 
agcagaaagg 
ctggctgctg 
ggctatggct 
aaggggaaca 
gtgttacagc 
tcacagctct 
tctggccaga 
ctccagccca 



gttctctgag 
gctgcctctt 
ctcaaacctg 
gttaattcag 
cactttctaa 
tcaccacagg 
gtgttctagg 
taaaatgtta 
ctccctatcc 
gcccacccaa 
aaactcatgt 
atgattagat 
ggttagttat 
tccgcctcat 
atactggcac 
tctttatgaa 
ggtagagatg 
accaggactt 
aaaatcaggg 
gatccttcgg 
agttttgctc 
tcatgctacc 
ggtgtgtgag 
tggtggggca 
ggacaaggca 
catggtgcct 
atgtcacagc 
tctctcatag 
aacgtgtttc 
cagttcctgg 



76140 
76200 
76260 
76320 
76380 
76440 
76500 
76560 
76620 
76680 
76740 
76800 
76860 
76920 
76980 
77040 
77100 
77160 
77220 
77280 
77340 
77400 
77460 
77520 
77580 
77640 
77700 
77760 
77820 
77880 
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gctggcctgg ccctgccctg cttcccatca 
gcaggaggga tacaatgtta cagctccttt 
5 ttgtcctgtg tccaggaaga atgaggttac 

aagagtttta ctgagcaata aaacagctct 
cctacccaga ggtgggtagt cccacgtggc 

10 

cccagactta ctgaattagc actctgggga 
ctgctgggtg attctgatgc acactctcag 
15 tcaaatgact gtttttctca ttttgagtac 

tgtttacaga ggcagcccat gccatctgta 
atgaactatg tcaaaacgga atgaattaca 

20 

aaaagatgac ccttcacctt tagacttgcg 
agcccaacca cttgctgaca gtctctggta 
25 ccttctcccc cttctcaccc cagcaaggct 

agttaaatca acaaccagga gtcaatacaa 
aaagcagcaa cattcgactt cacccaggta 

30 

ccagtgggcc aatccacaga ctgccagaag 
ctactgaaca ctgccagcgt gctcgcacgg 
35 atgaacacaa tcttatgaaa atggaagtaa 

aacttattct gttcaatgcc ctttgataaa 
gagtttaact gagcaatgaa caattcacaa 

40 

ctcagagact ccatgagtgc ctcgtgttca 
atgtactgga attggaagtg aggtacagaa 
45 ccttagttga aggcagcttg aatagtcagt 

ggattggcca atgctcagct attgttactg 
tgtctgacta ttaagctagg ttacagttca 

50 

gtccttctca ggccatattt agtttgcttt 
tattgcaaaa atacagagtg gcattatcta 
55 ggggctcaga gaggttaagt aatttgccca 

ggtcttaatt catgtctcct cactccaaat 
ctctcacagt tctggaggct gggaagccca 

60 

gtgaggggct gcttcctggt tcgtagacag 
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tgcagagcag 


ctgcccagcg 


tcagtagaag 


77940 


cgcagctgcc 


attccacggg 


tgctgggttt 


78000 


atggacaacc 


agagagtgag 


caaggcagag 


78060 


tagtggagaa 


gggccccaaa 


gcaggtagcc 


78120 


tgagtccaga 


ggttttatgg 


gctttacgat 


78180 


caggatccag 


caatctgttt 


ttgaacaagc 


78240 


gtttgagaac 


cactgctcca 


cagcatcttc 


78300 


cttctaacgt 


gatggcatac 


attatgatgg 


78360 


gacaattgta 


ctgagaggct 


cttcttaaga 


78420 


tcaagttttc 


atgtggaaga 


ttaaaaaaaa 


78480 


gaatcagagt 


attagataag 


actctgtctc 


78540 


ggttacgtcc 


cccatgttcc 


ctttggtcaa 


78600 


agtatcaggt 


acacttttta 


tgtctcataa 


78660 


actatataga 


tacagagggc 


aataaaggca 


78720 


agagggtaat 


aaaattccat 


aaaggacaac 


78780 


catcaggcaa 


ctgagcactc 


cccacctcca 


78840 


gactggtagt 


gctcaataaa 


tgtctgtcaa 


78900 


ttaacattta 


ttgagtcttt 


aatgtctgac 


78960 


agaaaaacct 


cagccgaatt 


aaatttaaag 


79020 


atcgggaagc 


ccccagaatc 


acagcagatt 


79080 


gaacaaattt 


atacacaaaa 


aggaaacaca 


79140 


acagtgagat 


tggttatagc 


tccgcatttg 


79200 


ggtctatgaa 


tggttgaagt 


atggccgctg 


79260 


atgcatacta 


ttaagttagg 


ttttcgattt 


79320 


tccacaagga 


ctcaaatata 


gaagtacaga 


79380 


aacacctttc 


agctgcatga 


tctcattttg 


79440 


tgctttagga 


aagagaaata 


tgaaaaatct 


79500 




s — j V— C*. C-*. d \* 


aataoaacta 


79560 


cccatgctct 


tttgccactg 


caaaatgtat 


79620 


agaccaaagt 


accagcagat 


ttggtgtctg 


79680 


cagtgctctc 


actatgtcct 


cacatggcag 


79740 
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^ggggccaag 

gcccctcatg 
ctaggtttta 
aaaaagctta 
cttgagttcg 
ggatgtattt 
ggtagaagcc 
gaattgtcta 
tatctgaaaa 
acttcaacca 
ctcttccagt 
gccacaaaat 
cctttcccta 
cccaaccctt 
gtgagctatg 
aaaaaaaaaa 
tccttatctc 
tcatcccagt 
cacgtcctca 
atttccttat 
tctcttgtta 
gaaaaggtat 
tttttcattt 
aaatatttta 
aatatggtat 
aataactttt 
gtttatgctg 
attcctgacc 
gtccaaatcc 
caggcatcca 



ggagctctct 
acgtaatcac 
acataggaat 
ttgatacagc 
gtgtatcagg 
tttgttgcca 
aaggataatg 
gcccaaagtg 
ggctattaaa 
aaacaacata 
aatccaaaca 
gtagtgtcct 
ccacttcttc 
tgggaggctg 
attgtatcac 
aaaaaagtag 
tgaataaatg 
ttattaccat 
tcaaacctag 
gaagaccatc 
atctgtcttt 
ttctcctctc 
tggaaaatag 
ttttaaaata 
aaatatcaac 
aaaaatgcaa 
ttattctcct 
atggcattca 
cagttgtcca 
ttcactcatt 



ggagtctttt 
ctcccaaaaa 
tttaggatga 
ttcagattcc 
ggttcttaag 
caactggagt 
ctgaacatcc 
gcattaatgt 
atactcactt 
tcaaaacaga 
ggaaagaggt 
agcaacggtg 
cctgaagcaa 
aggcaggagg 
tacattacag 
aaaagaaaag 
tggttacaaa 
taagtcatgg 
cataaaaagt 
acatcaagca 
tgtcaaaagg 
ctatgaacct 
aattattttt 
gattagtaaa 
aaatttaacc 
aggactcctg 
cctcaaagcc 
agaccctgtt 
gcctctctgt 
catcaaatgc 



attatataaa 
acccacctcc 
caaaaacgtt 
acactgcaag 
aagcgggaag 
tgagggaaag 
tacaatgcac 
tgagggctaa 
ttctaactac 
caatatataa 
atacatacct 
aaaacagcag 
ctcataagac 
atcacctgag 
cctagatgac 
tgccctccct 
atgatattaa 
ttttttattt 
acacaagttt 
aaacttatat 
cccttggcca 
ataaaacaat 
tatttaaaaa 
tcatttttaa 
cacataaaca 
agaccaaaga 
ctgcaagggt 
cagctgccct 
caatggctcc 
caagcacctg 



gccactaatc 
aaatatcatt 
cagtctatag 
taacctttcg 
aggtgaattt 
ggtgctattg 
aggacagccc 
gaaagaagac 
cgatctgatt 
gcagatgtga 
gatatggttc 
aaacaggaaa 
cctaattcga 
gccaggagtt 
agaatgggag 
acaccgagag 
acaaataggc 
aatcatactt 
atctgtttct 
taaataaatg 
taaatgtagt 
accactcttc 
tatttatgtt 
gtatctccat 
agtgttcttc 
gtgtgggtcc 
tcctagcatc 
tcaatccctc 
taccttgaac 
cctctgcagg 



ttattcatga 
acattgagga 
cacagagtac 
ggaactaaca 
ggtaacgtct 
gcatctagtg 
cttcatcaaa 
taccagtaat 
ttcttcatat 
aaatctagct 
aggtcacaca 
gtcatgctca 
aaggtaccct 
cgaggctgca 
ctcaaagaca 
gaaaggaaca 
cttgctaatt 
cacaattatt 
tcaggtcttt 
tgtatgcttt 
gatggttgaa 
ccactaattt 
aatatgtaat 
tttaatttct 
gagagtactt 
tgagaccaga 
cacagaataa 
ttcccacaat 
tcaccaaatt 
taagaaccac 



79800 
79860 
79920 
79980 
80040 
80100 
80160 
80220 
80280 
80340 
80400 
80460 
80520 
80580 
80640 
80700 
80760 
80820 
80880 
80940 
81000 
81060 
81120 
81180 
81240 
81300 
81360 
81420 
81480 
81540 
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atcccacccc 


ttaggagctc 


agaagctcat 


aaggaaggat 


acacacatac 


tcggataaca 


81600 


gtggggcaag 


gatgcaggag 


aggttccgac 


aaaacgctat 


gggagcctgt 


aatcccagca 


81660 


ctttgggagg 


ccaaggcagg 


tggatagtct 


taggtcagtc 


tggccgacat 


ggtgaaacct 


81720 


ggtctctacc 


aaaagtaaaa 


aatgggtctg 


gtgtggtggc 


agccacctgt 


aatcccagct 


81780 


aacctgtaat 


cccagctacc 


tgggaggctc 


aggcaggaga 


atcgtttgaa 


cccagaaggc 


81840 


ggaggttgca 


gtgaaccgag 


attgtgccac 


tgcactccag 


cctgggtgac 


atgagggtga 


81900 


atgagggcga 


atgaagctcc 


ttctcaaaaa 


aaaaaaggta 


tgagagtttg 


gagtaggatg 


81960 


ctgtcaaggg 


acaaagcact 


tatttttttc 


aagtggcggc 


aggtttcctt 


aggaaataac 


82020 


tgtttttggc 


tcttttgggg 


tatgggaaat 


gaagccctcc 


tcccatccca 


agaaatgacc 


82080 


aggttgggga 


gggtaaaaca 


gagaattgtt 


ctgcacaata 


ttgcaggaag 


catctggaaa 


82140 


ttctgagggc 


agctctgcag 


gggccggagg 


acgaggctct 


aggatgggaa 


gaaaaggaaa 


82200 


gagtggagag 


gaagaggatg 


gggcctagaa 


gcatggagga 


aggccagggt 


cctggagaag 


82260 


aaagggccca 


gctgaagttc 


tctgtgacag 


gaacgagcaa 


gggactaaga 


tcagagacat 


82320 


ccagcacttg 


atgtttagtt 


cattgctctg 


ttttggggca 


ttggcatctt 


tcccaccccc 


82380 


tacactttag 


taaaattccg 


ctacacagaa 


ccactttgtg 


tgagtaaagc 


tgtaggaaaa 


82440 


tcaaggaagg 


gagagttttc 


tagtgctgcc 


ataacaaagg 


actacaaact 


gggtggcttg 


82500 


ttaattcaac 


gtatcgtctc 


acagatctgg 


agactggaag 


tctgagatca 


aggtgtcagc 


82560 


agggtgggct 


ccttctgggg 


ctgtgaagga 


ctccattcca 


gacctccctg 


caggcttctg 


82620 


gtggtccgtg 


ggcatcttgg 


ccactccttg 


gcctatagat 


gcatcactcc 


agagtctgcc 


82680 


ttcatgttca 


tgcaggcttc 


ttcctgtgtg 


caagtctgtg 


caatatcccc 


tttttacaaa 


82740 


aacaccagtc 


atagtaaatt 


agggcccctt 


ttaacctaat 


tacttttgcc 


atgaaactat 


82800 


ctccgaataa 


agtcacattc 


tgagatacta 


gatgaggggt 


taagacttca 


gtatatcttt 


82860 


ttggagggtt 


ccatttcaac 


ccacagcagg 


agatgtgttc 


caggactgag 


ttcatgtggg 


82920 


gtggacaaac 


agccgtaaaa 


ggggagatgt 


gagggagatc 


aggagtgtat 


aggaaattga 


82980 


gcaaggaaat 


tgactatgta 


aagcgtaatc 


tacatgaaga 


aatcctccaa 


cttattggga 


83040 


atggagctac 


tatatatagg 


atggcggctt 


tagccagttc 


atgtaaacta 


aaggaaaggg 


83100 


tgattctata 


aggctggaaa 


aattggagac 


acacacacac 


acaaacatca 


caggcagctc 


83160 


acatttQQQQ 

—J 


CatQtQQQtt 


ccaaagagat 


ttaqtttqac 


tgtaaagttc 


aqqqaaqqct 


83220 


ttgtggagga 


agaggcttca 


ggatgagcgt 


tggaggatgg 


ggtgaatctg 


gacatgctcc 


83280 


cacacctctg 


gttcacactc 


ttctggaata 


cactgtgcct 


tcagtgcagc 


tttgatccag 


83340 


tctttctttc 


aatgtccaac 


ttgagttaga 


tgtccaattc 


catgtccagc 


cactcgacaa 


83400 
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acctcctagg accagcccat ccctttgaca 
tgactgaata tttgtttgtt cctatccata 

5 

tattctgtct cgtcaactcc ttgcaaattc 
agcacacacc agacactcaa taaatgctca 
10 cagaaaatct tgacccactt ctggatttct 

atcccatgga atcctacttt agccaaggcc 
tggatgccag agaatagacg tattgactcc 

15 

tgaaagggaa acctaatttg agtttatttt 
catcagctag caccagaatt tctagatcag 
20 agcatccaaa gacttcccag gagcacaagg 

ctgacagagc ctggatagca ccagcagaga 
tcttgagaac cacaaggaat caggggttca 

25 

gcgtgcagcc acccatcctt ggaggtgctg 
caggttctca aattgagctc ctgtctctgc 
30 ggagactctg ctaggcattt tctggctgcc 

gtgatcaggt ctacttagga cctcaatcta 
ctcttttttt tcttcttctt ctttttgaga 

35 

tgcagtggcg cgatctcggc tcactgcaag 
gcctcagctt ccagagtagc tgggactaca 
40 tgtatttttt agtagagaca gggtttcacc 

cctcgtgatc tgcccgcctt ggcctcccaa 
tttgttcctg acattttcct gacgggaggt 

45 

atagggccca actgtttgtg ccaattgtca 
aaagaaggag tcctctgaaa aaaaggaaga 
50 gcagagtata acatttagca cagtattcat 

ttcttgtatt tcaaatctca acttcagcat 
ttaacctctc tgagcctcag agagtttgcc 

55 

tgggggaact gccttatatc tacagggtta 
tgttattcta aaccactctc ctgggaaatc 
60 aggtctaatg cagaagtggc tcacactgcc 
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catctcccag 


ccacatcatc 


cctgatggtc 


83460 


cttgcactag 


agttatccca 


acagtggtta 


83520 


tagagcacct 


cgtatcctag 


cataccacct 


83580 


ctaattgctt 


ataagggaaa 


tctattggct 


83640 


cactttctca 


aagagaaact 


ttggaaggag 


83700 


ataaatactc 


tgaggggcaa 


ataaacagcg 


83760 


aaatgaaact 


gcaaagagga 


ctttggagct 


83820 


ctagatgatt 


aacattaaat 


agtgccaatc 


83880 


gctgcagaaa 


cccacagagc 


ctgaagaaag 


83940 


taagacttct 


gtgaattctt 


ccgcaagact 


84000 


cgtggaccta 


ggcggtccct 


cggaagtgtc 


84060 


gacatagggt 


agtgccctcc 


agcaagacag 


84120 


ggaggctggg 


aagtaactga 


agagtcacaa 


84180 


atggacacag 


caatcttaag 


atagaaggct 


84240 


ttgctcccta 


agcccagctg 


taggcacact 


84300 


gaactggtgg 


aatggcttct 


gatggaacca 


84360 


cggagtctcg 


ctctgtcacc 


caggctggag 


84420 


ttccacctcc 


cggtttcagg 


ccattctcct 


84480 


ggcgcccgcc 


accacgcccg 


gctaattttt 


84540 


gtgttaacca 


ggatggtctc 


gatctcctga 


84600 


agtgctggga 


ttacaggcag 


aaccattctc 


84660 


agaatctcag 


gggtcccagg 


aacatccctc 


84720 


ggtctacggc 


tgtattgttt 


ttcacacatt 


84780 


gagaaatgat 


taatttttta 


atatcatgaa 


84840 


tatgaaaatt 


gaaggcaggc 


agactatcaa 


84900 


ttcctagctg 


tgtgactatg 


gtcaaggcac 


84960 


cgagaaatgg 


tcataacaat 


aatatattgc 


85020 


tgaggcatca 


acatccaagg 


atgctgtagt 


85080 


tgttattctt 


caagaaagca 


aggaatgtca 


85140 


tgcacagcct 


ccattcatgc 


agctgaatgt 


85200 
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tccaccatcc atatgtcact ggtagaacat 
gcaaatagca gtcagaaaaa aataaatttg 
5 attgtagaca agttattcca ctttcagagc 

tcggatcaaa atgagtgaga agacaggggt 
tgacagttgt ttttcctatg gaagagtagc 

10 

tttaaaataa ggaaaaagct gggaattttc 
gttgggatcc aaggaaaaag aagtgaaacc 
15 ggaagtggtt ctcaaacttc agcagcatga 

atggctgatc cttgcccctg gactttctag 
tttccgacaa gttcccggtg atactgatac 

20 

tctgctatta atactaaatc taatagtttg 
ggcccctctc ttttttaaga ttggggtaac 
25 tattttattt tattttattt tattttattt 

acgtgcaggc ttgttacata ggtatacatg 
cgtcatctag gttttaagcc ccacatgcat 

30 

ttgcccccca accctccgac aggccccagt 
ttctcattgt ttaactccca cttatgtttg 
35 atgatggttt ccagcttcat ccacgtccct 

gctgcatagt attccatggt gtatatgtgc 
gggcatttgg gttggttcca agactttgct 

40 

ctacctttca acaaagaaca aatgttaact 
gttttctcaa tcttcaaaac accctttgga 
45 ataagaaact gacagcgtat gcaacttgct 

tgagaagtgg acttagggct gtctgagctc 
tactttatag acgattaata ttaaataatg 

50 

tgtccattga caatttggag tgtttctctc 
atttatactt gtgatttagt gaggacagaa 
55 gcacacgtta ttttgtgcca taggatttgg 

gtaaagggat aagcccacca gccacaacaa 
ctctcatctg aatgggatat cttcgcatct 

60 

tgcctacctc ctcccattgc acacttgctc 
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ttcttctggg 


aaatgaaaac 


atagtgaaat 


85260 


agtctcagat 


ctattcctgt 


gggttgcata 


85320 


cttggtctcc 


ctactttgca 


aagaggcggg 


85380 


gaatgcctta 

ZJ 


gccatcaggt 


gtcagaccag 


85440 


atggaaaggg 


gcctcagaga 


ccagctagat 


85500 


tgaagcttta 


tataaccaat 


attttgaact 


85560 


acaaaggaga 


agtgtagggc 


atttctgcat 


85620 


gattcacctg 


gagggtgtgt 


tgaaacacag 


85680 


taactctqqq 


qaqqqqa.cc a 


aqaatttgac 


85740 


tqctagtcca 


gagaccgtac 
— > — 1 — ' 


tttgagatcc 


85800 


qggtcaatta 
— * — ' — <* 


ggaaaaatat 


tcacaacaat 


85860 


acttttattt 


tattttattt 


tattttattt 


85920 


ttactctaag 


ttctgggata 


catgtgcaga 


85980 


tgctatggtg 


gtttgctgca 


tctatcaacc 


86040 


taggtatttg 


tcctaatgct 


ctccctcccc 


86100 


gtgtaatgtt 


cccctccctg 


tgtccatgtg 


86160 


gttttctgtt 


cctgtgttag 


tttgctgaga 


86220 


gcaaaggaca 


tgaactcatt 


cttttttatg 


86280 


cacgttttct 


ttatccagtc 


tattgttgag 


86340 


aaacaatggc 


ccttctctta 


tgggtggtct 


86400 


gttccctctg 


tgccaggttc 


tatgctggca 


86460 


gaaggtgcca 


ctattatgca 


ccttttatag 


86520 


caaggtcgta 


caactaataa 


atgaaggaac 


86580 


ttaaccctga 


tactgtcttg 


ctttcgagtt 


86640 


ccaattcatc 


agctagcacc 


aaactcaaaa 


86700 


actcaggatg 


cagctgggac 


agggggatca 


86760 


aqcctcctqa 


gaacatacct 


ctggggtcct 


86820 


tgtcttgcca 


gtaaggactg 


tataatctaa 


86880 


ttaacaagta 


aagtacacta 


gtaa.cattat 


86940 


aataatatct 


ttaatgttct 


gagggctaca 


87000 


cccagcatca 


aaaaattaag 


caattctcta 


87060 
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agtccagctt 


atcagtcctc 


atcaaatcgg 


5 


tcccttctca 


actttttcta 


atgctcaaga 




tgatcactgt 


cataatctct 


aggaccctgg 




«^ 4— 

aacaaaatcu 


/— i 4— ^ ^ +— /~r 4— 

CLaCaLgy LC 


yCaatCCCad 


10 


tgaaagggat 


gatcattccc 


aggccacttg 




cccaagaagc 


ctgggaaaga 


cagtctgttt 


15 


atcatacaca 


aaatgcctta 


attaaattaa 




tcaaggacag 


aagtacagct 


ggggcaaatg 




pa a a /~i aa /**< o a o 
CaaayaCCad 




yL-yL-yucn-yL. 


20 


gtgtgtatct 


gtgtgtgtgt 


ctgtgtgtct 




cctccttccc 


tctctctgtg 


tgtgtgtatg 


25 


tccctccttc 


cctctctctg 


tgtgtgtgag 




cctccctcct 


tccctctttc 


tctgcgtgtg 


• 






LLLLLLLLLL 


30 


ccttctctcc 


tcttctttta 


tctctccctc 




ctctactaac 


tggtgttctc 


tgctccccct 


35 


ctgcctagcc 


tcccatccca 


cagtagtcct 




aagttaccag 


agcatttggg 


gtcagttgtt 




aa fi(~mf*ff~x t~* aa aa r-r 

aggg gg caag 


atyy yy ciyy l. 


ay LL I— y LyLL 


40 


aatgccaact 


gcagggggct 


aagcaaagcc 




agaagctaag 


tagcctcggg 


aaaccctagt 


45 


agtgtgttct 


tgccaagaaa 


ggcaagaagt 




cttattttaa 


tgaagatgaa 


actaaagata 






yciclci Ly LdLt 


LLLLLdLy Ly 


50 


aggtgattgg 


agccaacagt 


gtacctggca 




agcccagcaa 


ccctgtggcc 


tctggaaatc 


55 


caatctcacc 


aaaagtaatc 


cactgtgata 




tggtgaacca 


gaactcagca 


gcaggtgtac 




atccagtggg 


aacagccagt 


ttgcagacgg 


60 


ccacgaatct 


ccagacatgg 


cctggagacc 
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gacaatatcc 


caggtctccc 


cccaagtgat 


87120 


ttcagtagaa 


actgcaggct 


attaagggtt 


87180 


accataatgg 


tttaaacaca 


aggaaaatgc 


87240 


caccaacatg 


gacaagaccc 


atgaagagat 


87300 


gcccatggtt 


cctcctggat 


gtcaagggca 


87360 


ttttgctctg 


aaaccaatta 


atgttgaaaa 


87420 


acccataaat 


gagttaacta 


gttatagact 


87480 


agaacctaga 


accgagagtc 


aaaaggacat 


87540 


gtgtatgtat 


gtgtgtgtgt 


gtgtgtgtgt 


87600 


gtgtctatgt 


ctgtgttatc 


tgtcttcctc 


87660 


tgtgtgtgtg 


tgtctgtgta 


tctgtcttcc 


87720 


tgtgtgtctg 


tctgtctgtg 


tctgtgtctt 


87780 


tgtctgtctg 


tctgtgtcta 


tgtgtcttct 


87840 


ccttccccct 


tttcctttcc 


ctctctctct 


87900 


ccccaatgct 


tctcagcttt 


attctcttca 


87960 


gcacacaaca 


tacatgtgac 


ccctccaact 


88020 


tgggtccagt 


tccaaattcc 


caggagaaag 


88080 


cacccccaga 


tcaatcagct 


ttgccagggc 


88140 


cactcaacaa 


actcaaggca 


tgcagattaa 


88200 


accatccaga 


ctgtgaacaa 


aaggtctaca 


88260 


cttgcaaatc 


caaatccgta 


aaaggaaaac 


88320 


ttattttgtc 


tggtaccatg 


ctaggtgatt 


88380 


agaatgagat 


aagatgcatt 


attcattttg 


88440 


ttttgcagca 


gttgtacacc 


atgaccgaac 


88500 


ttattgcccc 


agataatgtt 


cacgtcatcc 


88560 


atctgcagcc 


ttcagaggtg 


accacttacc 


88620 


caggaagagc 


aaacttacag 


aatctactcg 


88680 


agagtcaacc 


cattgggtat 


cagcgacagt 


88740 


tgcctggagt 


gatccaatac 


acacagggaa 


88800 


ttcagaatcc 


tctgaatgct 


aaccctgggc 


88860 
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tgacacacac ctcaaactca tcccagtgga 
atcccaagaa attcataaat gaggaggtca 
5 ctctccgatt tgggtcactt cataagccct 

gtgttgagag ggtaaagggg aggtggagtg 
ccagtacaat cacccagatg ctggcccctc 

10 

acccatggtt tgggggataa aggtttgaat 
atctatcagg ggcatgggtc agcaatagtt 
15 ggcaatccag gtctcaggcc agctagattt 

gctctgggcc tccatttcct catctttaga 
ccagctccaa ggttttgtga tttccttttc 

20 

tcctccagtt tcatttagca ccttccaaag 
ccagatgcct ggctggatct ggatgcctgg 
25 cagatgccca tactctatgc actcatccat 

gagactcaga gataaccaaa acatcatcca 
agctgtcata attacaatac cttctggcaa 

30 

ctgcccaaag tgcaaattgt ctggcttctt 
ggaagacaag gacaggtgca ggaaggggac 
35 aagtagattg tgccaggaac aaggcgttgt 

tgaactaaat tctataaaac agcaatcagg 
ggctcatgcc tgtaatccca gcactttggg 

40 

aaatcgagac catcctggct aacatggtga 
tagccaggtg tggtggcggg tgcctgtagt 
45 atggcgtgaa cccgggaggc agagcttgca 

cctgggctac agagctagac tcagtcttta 
tcacaagcat caagtgagat tataaatgtg 

50 

ctatgatatt gcgtagtaat tatcatcttg 
atactgggta aaggtttgtt aactttggct 
55 ttcaaaattt gtcaaaacag aattaagtca 

aaatgaaaag ccaaaaagcc aaaactgcag 
gggaaggaaa ctatgagtgc ctcttagccc 

60 

gcagcacttg cagctgcagt atttagcccg 
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acacgtcatt 


tgcatcattt 


acttcattta 


88920 


gaacattagg 


ggtgagtgtg 


atttctcctt 


88980 


cgtctagtgc 


tctgggatga 


ggaagggaat 


89040 


ggtgtggggg 


gcagtttctg 


gaggctggac 


89100 


tcccttgtct 


tccttccact 


ccccaaggag 


89160 


aatcagtgaa 


catccccacc 


ttacccagct 


89220 


ccactaaaaa 


gaccactgag 


cagggagtca 


89280 


gctgagtggc 


cttgcagaca 


tctcttcctc 


89340 


ataagaagat 


tcagctttct 


taaattcctt 


89400 


aatcaagggg 


gcctctgcct 


ctttcctgcc 


89460 


ctcaattcaa 


ttctgttttt 


gaggcagaag 


89520 


ccaactctta 


gtatggagac 


tggcgtgtca 


89580 


tcaacactgc 


acgtgccctg 


caaggagctg 


89640 


tgaagatgaa 


ggagggtctt 


tgttcatggc 


89700 


ttgccatgag 


acatgcataa 


gttattgtgg 


89760 


tgagaagtta 


caaactgcct 


cctggaggca 


89820 


ccattcctcc 


ttatttaatc 


ctgggtagac 


89880 


ttgcctccca 


aacaaatcaa 


aatcaaatcc 


89940 


aatagaaatc 


ctggtcaggc 


cggacacggt 


90000 


aggccaaggc 


aggcagatta 


caaggtcagg 


90060 


aaccctgtct 


ctaccaaaaa 


tacaaaaaat 


90120 


cccagctact 


ctggaggctg 


aggcaggaga 


90180 


gtgagccgag 


acagcgccac 


tgcactccag 


90240 


aaaaaaaaga 


aaaaaagagg 


gaaatcctgg 


90300 


tgcggaaatc 


ctccaaaagg 


taaagctggt 


90360 


tattacaatg 


actatcatta 


ttattaatta 


90420 


agcaattatt 


ctaagatatt 


ttggaaggat 


90480 


t~i s it t~* a a 3 f" 3 t - 
yciyi— .ctclclL-ctL, 






j \j j ^ \j 


actatcttta 


atcaattaca 


attttgtttg 


90600 


ttttcccgat 


tgccccaaga 


atattcgcca 


90660 


agataacttt 


gccaatgtct 


cccttttatt 


90720 
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ataactttcg ccttgctcta gcatattgac 
agcattctgt ttttagtagt agtattccca 

5 

aaaatgtcaa atcctagaaa acgtagcatt 
cagttgttgg ccaaagattc atttgatgaa 
10 gttagttctg tttagaaatc actcgaaaaa 

atcagccaga tttgcttcag cctcaagagt 
agcgagcggc actatttttt ttctaaatgg 

15 

gggcctgtag acagagtcaa gcatgtcgcc 
tctgtcacca actcatagac tcatggacca 
20 tcatttgatc cctgaggaaa ctgaggccca 

ccaaatgaga ggggattcat cctggacggg 
agggaaggat aggatttcag gaggaggaga 

25 

cagtggagga aaagcaggga gtaaggggaa 
tccccttgtg tacgttgcca aggggaactg 
30 aatatcggta ctgggaagat gaatggagtc 

caggcctggc gtggtggctc acccctgtaa 
gttcacgagg tcaggagatc gagaccatcc 

35 

aaaatacaaa aaaaaattag cctggctggg 
gaggctgagg caggagaatc acgtgaacct 
40 acgccattgc actccagcct gggcgacagg 

aaaaaaagaa acacctactg catgctagga 
atctctacca cagccctgca agctggatac 

45 

ctcaaacatg ctcttgaggc tgaagcaaat 
taaagacctt tctttgagtt atttctaaac 
50 gaatcatcta ttttggaaaa atcagattca 

gaacgatgtt aacatcacgt gtaggaatgc 
tccctgcaaa taaggaaacc ctgaggcaga 

55 

ccagtgctgc agccaggaca ggagccaggt 
tgtctctgca cacttgtggg ctcagtctaa 
60 tgtcttacct ttcctttatg cctgtgattc 
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cttggaagca 


agagacatca 


ttctatttat 


90780 


tttacaaaat 


acagtaattc 


tcgattacta 


90840 


tctacgcatg 


gtgttaacat 


tgttctcaaa 


90900 


tccagtgttt 


ccgaagtaaa 


caattctaat 


90960 


ccgtttttat 


catttatttt 


cacaatgaaa 


91020 


gtgtttatgt 


aaaattaaat 


gagtgctggc 


91080 


gaacagggtt 


cattaatata 


ctgataaagt 


91140 


ttaggaaggg 


atgttcatgt 


gaaacccagg 


91200 


ggaagacaga 


aagcagctgc 


cctaggctag 


91260 


gagaagccag 


tggtaaatgc 


tgtaggagcc 


91320 


ggctcaggaa 


aagaatctct 


ctggtcctta 


91380 


gtaaagcaag 


cagatttcag 


acagtggcaa 


91440 


aaacacagga 


cttgcagggg 


acagaggatc 


91500 


tggggacgtc 


atagaggagg 


ccacaaagag 


91560 


agcagagggc 


attgaaatac 


ttactgcatt 


91620 


tcccagcact 


ttaagaggcc 


gaggcgggcg 


91680 


tggctaacat 


ggtaaacccc 


atctctacta 


91740 


tggcaggcac 


atgtattccc 


agctactcgg 


91800 


gggaggcaca 


gcttgcagtg 


agccaagatc 


91860 


gcaagactcc 


gtctcaaaaa 


aaaaaaaaaa 


91920 


actgggctac 


aagtttcata 


ggcactttta 


91980 


tgttaccccc 


atcttgcaaa 


tatgtagcct 


92040 


ctggctgatt 


ttcagagtga 


aaataaaata 


92100 


agaactaaca 


tcgaatggtc 


tgaatcatca 


92160 


tcacatgaat 


ttcggccgac 


aactatctga 


92220 


tacgttttct 


aggatttgac 


atttttggtt 


92280 


gaggatattt 


gctggagacc 


acacagccgg 


92340 


ggttttcagc 


cacaacccgg 


gctcggcccg 


92400 


agggaacaca 


tttcttgctc 


tgttataggc 


92460 


tcagcctgag 


tgcaaattcg 


catcacctgt 


92520 
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gaagttttta aaaaaacacc caggtgtgga 
tctgggatga gcccccaaca ccaatcaccc 
5 caaagcacag ctccacagcc cacaggcccg 

ccctgcaggc catccagatc ctcatcggcc 
tgctgtatta ctatcctttt gtgacctggt 

10 

ccgtgagtac aaggccatat ggtctccttc 
tggaggaaga aggagcatgg aatcccattc 
15 gcctccaact tgcagataca gcaaagggtt 

tcccaccacc accctcccca tctcccatac 
tacaaacact cattatcatg ttctttacct 

20 

gatatttatc tgataaggcc aggcagggtg 
ggccgaggta ggtgaatcac ttggggtcag 
25 agacctcccc ccacccgcca tctctacaaa 

tgcacttgta gtcccagcta cttgggatgc 
gtggaggctt cagtgagcca tgatcacacc 

30 

agcctgtctc aaaggaaaaa aaaagaaata 
gtgtgctata gacagcatct acctaccata 
35 aggaattatg taaaatgatt gtcaaaccat 

tacatacgac atggaaatca acacgtgctc 
acctcataag gcagatgtta atcatttacc 

40 

gagcttaaat aatattaata aattgaacac 
cctagggtgg tcaaattcat aaagatagaa 
45 gtaagagggt aatggggagt tagtgtttta 

aaggagatct tggggtggct ggttgcacaa 
tttttttttt ttttgagggg gagtctcgct 

50 

atctcagctc actgtaacct ctgcctcctg 
cgagtagctg ggattacagg catgtgccac 
55 agagacgggg ttttgccatg ttggccagcc 

gccccccttg gcttcctaaa gcgctgggat 
caatatgaat tttcttggtg ctagtaaacc 

60 

ttaagttgtg tgtattttac cacgattaaa 
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ccctaccctc 


agagatttgt 


atccatttgg 


92580 


ctggggtgat 


tctggggcac 


agccagggtg 


92640 


cagttctgct 


ctctcacagt 


gacttgttct 


92700 


tgacgcacat 


tttctctgca 


attaaccctg 


92760 


tgtcagggta 


cccgctctgg 


ggaggattat 


92820 


ctgggtcacc 


agactgggac 


aagaaggaag 


92880 


ccagtaatat 


gactctgtag 


actagagcca 


92940 


atatcaactc 


cctgccccag 


caacactccc 


93000 


cccccaccca 


gcctctaggg 


aactattggg 


93060 


cttccaccct 


ctctgggttc 


tcttttatga 


93120 


gctcacgcag 


gtaatcccag 


caatttggga 


93180 


gagtttgaga 


ccagcctggg 


caacatggca 


93240 


aatacaaaaa 


attagctggg 


tgtggtggtg 


93300 


tgaggtggaa 


ggatcacctg 


agcccaggag 


93360 


actgcactcc 


agcctgggta 


acagagtgag 


93420 


tctcatatgt 


aactcagagt 


acgtaaagtt 


93480 


ccagttatgg 


gagccgactg 


caaaatttcc 


93540 


ttggtagcag 


gaagttggcc 


atgttgggta 


93600 


caaatcaggg 


cccttccctg 


cagtgctgcc 


93660 


agcaggccat 


tcagtatatt 


tcatcttgca 


93720 


ccacatgtac 


aattccactt 


agatgaggtc 


93780 


aaagcagcat 


ggtgattgcc 


aggggctggg 


93840 


tggggactga 


gtttcagttt 


gggatgataa 


93900 


caatatgaat 


tttcttttct 


tttcttttct 


93960 


ctgttgccca 


ggctggagtg 


cagtagtgcc 


94020 


ggttcaagaa 


attctcctgt 


gtcggcctcc 


94080 


catgcccggc 


taattttttg 


tatttttagt 


94140 


taafcc tccraa 


ctcctaacct 


caaataatct 


94200 


tacaggcatg 


agccatccca 


cccagcctaa 


94260 


aaacacttga 


aggggtttga 


atggtatgtt 


94320 


aatagtaaat 


taaattaaag 


ttttaaaaaa 


94380 
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tgaacaccta cgtgtccacc acctaactta aaaaacaaag catcacacgt ggtttttaaa 94440 

tttttaaatc cctataagta tcccctctat gccccgagag gtaaccacta ttttcaattt 94500 

aatgtttttc tttttttttt ttttttgaga cggagtctcg ctctgtcgcc caggctggag 94560 

tgcagtggcg ccatctcggc tcactgcaag ctccgcctcc cgggttcacg ccattctcct 94620 

gcctcagcct ccagagtagc tgggactaca ggcgcctgcc accgtgcccg gctaattttt 94680 

tttttttttt tttttttgga tttttagtag agacagggtt tcaccggggg ctcgatctcc 94740 

tgaccttgtg atccacccgc ctcggcctcc caaagtgctg ggattacaag cgtgagccac 94800 

ggggggtggc cgatttaatg gttttcattc ccttgctttt tttttttttt ttttttttga 94860 

aaaggggtca ctttgtcacc caagctgaaa cacaggggca caattttggg tcactttaac 94920 

ctctgcctcc taaggtcaag cgaatggccc gccccaaccc cctgaataac tgggattaca 94980 

ggggggcccc acccctccca actaattttt gttttttcaa taaaaacagg ggttcaccat 95040 

gttaaccagg gtgggctaaa actcctgggc tcataatcca cccacctcgg cctcccaaag 95100 

tgctgggat 95109 

<210> 81 

<211> 19303 

<212> DNA 

<213> homo sapiens 



<220> 

<221> genomic_JDNA 

<222> (1) . . (19303) 

<223> n is an undetermined nucleotide (dATP, dCTP, dGTP, or dTTP) 
<220> 

<221> coding_region 

<222> (8830) . . (9012) 



<220> 

<2 21> coding_region 
<222> (9468) . . (9587) 
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<220> 

<221> coding_region 

<222> (10759) . . (10815) 

<220> 

<221> coding_region 

<222> (12466) . . (12597) 

<220> 

<221> coding_region 

<222> (14049) . . (14159) 

<220> 

<221> coding_region 

<222> (16890) . . (17009) 



<400> 81 

atttttctaa agtacaaagt catcatgtta ctgttctgcc taaacttatt caggctctcc 60 

tttgcccttg ggctaaagtc caaatgctct agcaacactt acaaagccct ggagatgtag 120 

cagttcacct ctccagcccc atccctcctc accctccact tacacttgac cccgagccat 180 

cctgagtctc cgtgagctcc atctgcctgg aaggctgctc tctgcctgct tccaaccctc 240 

ctatttgtcc tgcttragctc ccaaccccac ttcagacctt catttagata ccacttcctc 300 

caggaagcct tatctggcct catcccccaa ccccttctag attaggggaa ctcctccagg 360 

ctctcgccac acccacatct cagtatcaca tcactcaccc gtccacagca cagttacctg 420 

tctccctgtt catcttattc aatggacttc atattctttc cctattgctg ctgtaacaaa 480 

ttaccatgaa cttagtggcc taacataaca cagacttatt attagacagt tctgaggtca 540 

gaggtcctaa aatccaggtg tcggcagggc tgagttcctt ctggaggctc cagggaggcc 600 

ccgcttcctt gccttttcca gcttctagag gccacctgcc ttgcttggct tgggaccttg 660 

gcctccacct gccaagctct tccactcact ctgaccctct tgcctcccgc ttgggaatac 720 

attgggccca ctgaataatc tatccctatc aacaactttg acttaatcat ctgcaaagcc 780 



-329- 



WO 02/062946 

cttttcactg cataaggtaa cgtattcaca 
ccattgctct gactaccaca aattctgagc 
5 ctcctggatc cccagcatgt cacacagcgc 

tatcaatgaa cggatcagtt aatgaacaag 
actcagcaaa aaagcactgg tcctggaaaa 

10 

cgtgggtgat gaacgggtgc aaggcaacca 

agcattacag tgggcaggca agtgccttca 

15 ctgctgctca gcctggcccc ctacccctca 

gaggggtgag tggccttttg cctgattgtg 

# 

acatgcatga cagctgagca cgagcgaccc 

20 

cacggccact gacacgtgtg aggcgctgct 
aaccctccag caacttgatg ggtactatta 
25 ccaccaaagt caaataactt gcccaaggtc 

gaactcaggc atcccagatg cagagtcagc 
aaagtcagtc agtcctggca tcagggacta 

30 

tccctgtcca ccctcagcag agcgatgtgg 
caaggcggga tctaggggtc gctggatctg 
35 tgggggttgc tggatctggg agttgtggga 

gatctggggg ttgctggatc tgggagttgt 
gtgggatctg gaggttgctg gatctggggt 

40 

ggttgctgga tctgggggtc actggatcgg 
tgggagtcac aggatctggg ggtcacggga 
45 gatctggggg tcacaggatc tgagggttgt 

gtggtacctg ggggtcgctg gatctggggt 
gtcactggag cccactccac acctcctcca 

50 

ttcccagagc agcaccgtgc tcatggaaag 
gtcagatgca atttgtgggc catgcctgga 
55 taacaagaca tttgtgagat aatcagaaac 

taagtaccaa aaactagata agaacaggtt 
cacaacaggg gtccccaacc cccaggccat 

60 

accaggcctc acagcagaag gtgaggggca 
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a tat aaacat 


tQttcraQQtcf 


840 




caaaaacaac 
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cQCtacrctct 
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1680 
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caaatcrciaaa 


aqqaaqaocc 


1740 
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r3ZJr3 v -* r *^ % -' v "Z3Z3Z3 
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crcrcrorcr t c a c t 
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teataqrerate 


2040 


t ctQQcrcrcrtt 


qtqqqatctq 


qqQQtcatQQ 

3333 3 3 


2100 


aaaatctaaa 


qqtccctqqa. 


tctqqqqqtt 


2160 


cactaaatct 


aqqqqtcatq 


qqatctqqqq 

33" ^ ^3333 


2220 


ccaaaacaac 

V V** V»*». Vp** >**** *•***• ^pp" 


ctcacQcraaa 


accttcacac 


2280 


a t cacrocrcac 


aacratQtcfQC 


aactqat tct 


2340 


t cc tcfcratcc 


tqatttqaac 


aaatcaaaag 


2400 


tgaagctccg 


actagttatt 


tgataccegg 


2460 


tttctgactc 


ctagggttaa 


gcttgtcccc 


2520 


ggagctatac 


cagtctgtgg 


cctgttagga 


2580 


acaggcaagc 


atgacctacc 


acctgagtgc 


2640 
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cgcctcctgt cagatcagcc gcagcaatag attctcaaag cagaacgggc cctattatga 2700 

attgcacatg caagggatct aagtggtacc ctcatgagaa tcgaatgcct gatgatctga 2760 

ggtggactaa tttcatccca gaaccatccc ccatccccac cccggcccat ggaaaaattg 2 82 0 

tcttccacga aactggtccc cagtgccaga acggttgggg accactgctc tatgaaagta 2880 

gtttcttttt tcctttcctc tttttttttt tttttttttt ttgagacagg gtcacactgt 2940 

gttgcccgga ctgaggtgca gtggtgtgat cacagctcac tggactcaag tgatgctccc 3 00 0 

acctcagcct ctcgagcagc tggaaccata gacagggacc aagacacctg gctcattttt 3 060 

gaatttttga atttttgtag agatgaggtc tcgctatgtt gcccaggctg gtctcaagct 312 0 

cttggctcaa gtgatcctcc cactttggtc tcccaaagag ctgggattgc aggcatgagc 3180 

caccgcccgg ccctttaaaa gactagtttc caaatgttca acagagctct gtggggctgc 3240 

agggttaaac tctgagcgta aatagacctc agtgagcacc aggactcggt tttaattttt 3 3 00 

ttgttctgat catgtggtca cacttttttt ttaaagagtc catatctttt agagataaat 3360 

actaaaatgc ttacaaataa aatgggaaga tatctgagtt ttgcttactt ctgggggaga 342 0 

ataagattgg ccaagagctg ctgatggtgg ctgtagctcg cgaatgggca caggggtgtc 3480 

atggaaccat tctctctact tttgtgagtg cttgaaattt tccagaatta aaagaactct 3540 

ttcaaaaggt tcaacttctt actccactcc caggatctaa gacgattaca accactttca 3600 

ctccatcagt catgccctga aaacagatta aattatcaac atttcccttg ccaaaggaaa 3 660 

tttacaactg tcaaggagag acataaagaa gggcactaag attttcacct gccggggttt 3 72 0 

99cgggaggg ggaaggactt tgacagcttc catgcaaccg gggcatgaag cagggcgtgg 3 780 

gtccaggagg gagaagatcc aaagtcgaat gttgtaaaga tgtgctgatt tcccagcgct 3 84 0 

gtctctgggg agctccggca gcgcaagagg gcaaagcaca gctggaagct cagagctgca 3 900 

gtcccaggtc ctggggtgag tggtcccccc atggcaggaa cagaggggag tgattttgtg 3 960 

tttccatgtg gagttggggg gttgctcagg tgtacagacc tggtcccccc ttgaagccag 4020 

gggaccggcc tcttataccc ctagccaatc agttgttggc tgtagggttg gagcggtgtt 4 080 

acttaaactc ccagaaggga ggagatgtga gcctttagca atgaacaatt ataccagctg 4140 

gagaatgggt ggcccaccca gcagagggga tctgggaagt agggggcacc gtcggcctct 42 00 

cccctgcgga gccccatcct gctgccagtt actgaccctt attgctggag gagccccaga 4260 

cacttttctt gcagtctgga aatgcataca cacagagaga ccagttcaag aaaacagagc 4320 

cccagggagg agaagggact tgtcgagggc cacgaggggg caggactaga atccagattc 43 80 

ttgactccct ccttggacgg aggcctctgt ttcctgcagg ccttccagct gctggtgggt 4440 
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gagctctctt 


tacaccaggc 


acactgtctg 


gaagcaccag 


gaggaggctg 


gttcagtaca 


4500 




gtcgtaggtg 


gttttctaac 


tggcattcac 


tgaaacgctg 


atcctctaag 


gcaccccacc 


4560 


5 


cccaacacca 


gcctgcctgt 


cacatgttgt 


gacactcttg 


tcactcagca 


cctctcaatt 


4620 




cctgcctttc 


tgtccttgac 


atgcatgcac 


atgcatgcac 


acacatgcac 


acacacacac 


4680 


10 


acgcacacac 


acacacacac 


acacaccctg 


agaagctctc 


tctgtcctct 


agggctccac 


4740 


tgcacagtct 


gatgagggtc 


atattattca 


tgccacgttt 


cctctgcctg 


ctcagccaca 


4800 




gacctcccat 


ggacacatct 


tactcactcc 


tgcctcggtc 


cctgcagtgt 


ccagtgcaga 


4860 


15 


aactccacaa 


acaaatccac 


gaatgacagt 


gttggatctc 


agatcactag 


ctcaatacac 


4920 




tcagtgcctt 


tcttccctca 


cgacccccta 


cccccacccc 


agggccccag 


agctctcacg 


4980 


20 
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cctgtaagca 


ggatggggat 
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gcagatggag 
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25 
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aaggtggagc 


5220 
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5280 


30 


ctatacgcca 


gtccctgtca 


gttgcaggca 


ttgtttcagc 
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5340 




ctagggacag 
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aaggtgaggg 
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5400 
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ggagcagagc 
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35 
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tttgctgggg 


5520 




ggttctggag 
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agctctgtcc 
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gaactgaagg 


5580 


40 


aacaacgaga 


atatgtgtgg 


gacccaggca 


tcccgtggcc 


ctgtgcagag 


cgctgaggcc 


5640 




ctagtggcag 


tgggtttccc 


gtgggatacc 


aagctggctg 


ggtccgtggt 


cctgagctgg 


5700 




ctgagggggc 


tggggttggg 


cggggaaagg 


ctaaaaaagt 


tcctttatgc 


accaaacagc 


5760 


45 


ctgctgcctg 


ccccctctct 


gtcagcacac 


tgtcacctcc 


tagcagccct 


gcacaggtcc 


5820 




tggaagtggg 


ccaggacagc 


tgggaataag 


tgttctaagg 


aatcagagtt 


catgctcttt 


5880 


50 


gtcttggaaa 


ggcgtacgag 


agagtcataa 


caaacttttc 


cccatgatga 


ctaaggtctg 


5940 




ccctctgcaa 
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tcctggttgt 
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ccaggaccag 


aaattgaaac 


cttcaaggcc 


actttgccca 


aaagcagacc 


ccacagaata 


6180 


60 


ccaattgcat 


agaaccgtta 


gacaatgaga 


aaaaaaagat 


cctatggcca 


gccgggcgca 


6240 




gtggctcaca 


tctgtaatcc 


tagcactttc 


ggaggccaag 


gtaggcggat 


cacctgaggt 


6300 
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cgggagtttg agaccagcct gaccaacatg 
attagcaggg cgtggtggca ggtgcctgta 

5 

gaatcacttg aacccgggag gcggaggttg 
agcctgggtg acaagagcga aactccgtct 
10 aataaataat aaagattata tggccaatta 

gagtgtgcca cttgtagaaa atctcagagt 
aaccatggtt tctcaaactt atttgaccat 

15 

caatgttcca tggaacatac tttgagaaag 
gttggatatt tgaaccattc ctactgccac 
20 tacagctcca gtgcccaggc actcactatc 

ctctactaag aagtccctct tcttctgttc 
tattgtcctc ccaggtgtgc agagtctagg 

25 

ctgccaccga atacatccct tagtccagct 
ggaaccttcc caagggacag ttcccagagg 
30 acaaaaagaa tcatacagca ctgtgtctat 

gagaccctct gcccactcac cccactaaat 
atcaatcagg gggtcctacc cagcagccca 

35 

atgcgtggtt ccaattccag gcacttcctc 
tctctgaacc ttggtttcct cttctatgca 
40 gttgccaggc aaaactgcct cctggcacag 

aggccccatc tggctctttg gctcttgata 
aggatatagt cacctgggct atttgtccct 

45 

cctgtggaaa atgtggatgc tcaatggaac 
gagaggaatg gccctccctt ttcattcatc 
50 ctcaaggatt tgctcattag ccccatcccc 

gcaagccagc tccctttctt ccccgcccag 
ttggattttc tcaccaagtc ccaggcagcc 

55 

tatacactat accaggccct gtctgcaggt 
cccaactcct ttttgtgccc atgagaacat 
60 acagtgcatg aaggacttaa gccccaggct 



aagaaacccc 


atctctacta 


aaaatacaaa 


6360 
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ctcaggaggc 


tgaggcagga 


6420 
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6720 
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6780 
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taggtggagg 


6840 


tccgtggaca 


agcagttcca 
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6900 


tttgcagtgg 


agtccaggcc 


tgggtccagg 


6960 


tgaagccagt 


ggctaacctg 


gttcaggcct 


7020 


agctgtgtca 


gacggacttc 


tgaaggcttg 


7080 


gacagttccc 


atgcatagct 


gggcagagtc 


7140 


gagctcagct 


tctgaagccc 


aggaaagagg 


7200 


cagcggtcgt 


cactgcagga 


ccatggagat 


7260 


ggcccagagc 


actggcgatg 


gagtaagacc 


7320 


actgtgtgac 


catggccaag 


gcaggcaacc 


7380 


atggtgctga 


cttcctgagt 


attaagggag 


7440 


tgcccggcat 


acagtgctca 


cccaactgtg 


7500 


atgtcccttc 


ctttcccctc 


tacccaatcc 


7560 


ccaaacttcc 


caaaactctt 


tccctgcatc 


7620 


ttcctggatc 


cacaagtcag 


acttaaaagt 


7680 


caggatacag 


tctgagagca 


gcctctctct 


7740 


atggagaaac 


catctctgac 


ctcaatccct 


7800 


ctgtccactt 


cctgggacag 


aaagactacg 


7860 


cccaagccct 


agtgcatccc 


acggcgtcag 


7920 


cgtgtgtcca 


gcccccttct 


aacttttttc 


7980 


ccagaacatt 


catttccaat 


tgttgagcca 


8040 


ctggagccag 


cctgccatgc 


ttcaagtcca 


8100 
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agtcacttac tggttgtgtg accatgggca 
aaaatgagag tgaggacggt atttacctca 
5 tgcacgtaga aggtttatct cagtgcctgg 

cgttaaatat ggacttcctg actcagtgtt 
tttcttccct tctctccagg aggcatgggc 

10 

cacctgcctt cagtcactct tcttggacca 
attgacctgc acctctcgca cctcccaact 
15 gagctctgtc ttccactggt tgtgaagttt 

agctgtgtct ttgcaacccc agaaaggctg 
cagattattc caggtcaaat caagaccaag 

20 

ttctggaaag atctggtccc ttaaacagag 
cctcagaaat gagacgggtt ctgacggcct 
25 ccagcatcaa tgaaagcaga agccacagtt 

tcatggcaag tcctcagccc agtccagccc 
cagcccaagc tcctggctcc acaccagcac 

30 

aaggagctgg gggtgagcat ccacttccca 
gggatatgag gaggatgctg gggggcccca 
35 gctttcctgc tgaacattca tctgaactgg 

ccacacctac tactgccttg tgaggtttgg 
aggctgccag ctccaggata gatagctccc 

40 

ttctgcagga aatcctgggg agtggtcctt 
gaggacagaa gggactcacc acagccagta 
45 ccagtgaccg atgccctcac cccagccatt 

ccatcgctct gctgcacctg gtctttgggg 
acctggtggt gctgaagtct tggtatccat 

50 

aaaacacaga ccgggtgtgg gagtctgcaa 
caggatgcag ggtgggatag agagagctaa 
55 ccagtgttcc tagctctgct tcttttatct 

agcccctgtc aattctgagt cccaagagag 
gaatatgtgc atttggcaaa gaggagggtg 

60 

ccagccttct gagcagccag gctgcctctg 
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agtggcttaa 


cctttctgag 


cctcatctgc 


8160 


tagggttgga 


atgaggacaa 


aatcaggtaa 


8220 


agcagagaaa 


gtatttgtaa 


atgttagcat 


8280 


ggctcataaa 


aagtcctcgg 


tggaagatat 


8340 


accaccacag 


gaggtaggac 


aattatttcc 


8400 


gtggatggag 


actgtttaca 


aagcccctcc 


8460 


cttccaaact 


gcagtctggc 


cactttgggt 


8520 


gcctggtcct 


catttttctg 


ccccaaaagg 


8580 


tgagcagcaa 


ttactcccat 


ctccaggggc 


8640 


accaggggct 


gccagacagg 


gcaaccacag 


8700 


gtgattggca 


gaggctcacc 


caggcacaag 


8760 


tcctccccgt 


cctgcagcca 


gggcccccat 


8820 


attcccagcc 


gttgtgctag 


ggggctccca 


8880 


tggcagacaa 


gtgcacccca 


gaacacgacc 


8940 


gagaagtccc 


agaagaagag 


cagccttctt 


9000 


gggtcccagc 


agtgggaggc 


agaggagagg 


9060 


agggaagaaa 


gggaaaagag 


aaaggctcca 


9120 


aaaaggcctt 


gaagattgga 


gagaatctgc 


9180 


acagggccct 


gtcctgggga 


gaatggcaga 


9240 


acccatgggg 


cccttggcag 


cttcagctct 


9300 


cttagagagc 


ctggtctctg 


ggatccctaa 


9360 


ggtccagggc 


actagtggcc 


tcaattagag 


9420 


ctctcttccc 


cacccaggcc 


ttccacatca 


9480 


gctacctggc 


ctctatagtc 


aagaaccttc 


9540 


tctggggggc 


tgcctctgtg 


agtagaaggc 


9600 


ggcaggcctg 


ggaggccctg 


gcatttggga 


9660 


taggagtgcg 


gcagaagcga 


ggccctttct 


9720 


taatcacaat 


aaaaat aac t 


ccttccaqqq 

N^fc 


9780 


cacttggagg 


ctatccattc 


tcccataatt 


9840 


agtaagaccc 


agggacagct 


tcattcttcc 


9900 


gatccagcta 


caggtgccct 


gttggggttg 


9960 
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agctacaaaa acacacaggg tgtaaggtgt 
gatgggagat gacactggac cagtgcatgc 

5 

ggtctccagg gagcccaccc cattcctgac 
tccaagctcc acggcactgt cttgggcaag 
10 tgttctcact gcccacagga tggggctttc 

cagtgcttat tcaccctgtg ctcccaggga 
ggaaagaatg attgcagagg tgctcccaga 

15 

aactgcaaga ttcaagaaaa gagaaacagt 
agcgatggtt tgggtttgta aaatgaggga 
20 gcagggcaca gcttggtcca aggtgtagag 

gggaagatgt gaggtggggc agcttcttgg 
ctttgctgga ggaatagtat tccctgcccc 

25 

aataggggtg ggggtgggaa tttgtgggtt 
ttcaaatctg tcctgcagtt tctcatttca 
30 tctaaaactt acctggtgag tgggcacact 

taactttgtt aagggcatct gtctgtaggg 
ctaagagcag gctcttgcct tcaggacagc 

35 

cagacaggct gacagacttg gctgtgggag 
ttccagtgca gtgaggcaga gaaggaaatg 
40 gacacaactt tgcttcctct caaatctact 

caagccccag cctccagctg gatttgcatc 
gaacggtggt cctccctttt ctgcccgtgc 

45 

cctgcacaca tggctcccat ctttgactta 
ttgtcctccc ggagatcttc cctgacctcc 
50 tggacttacc tgtctctact gtacactaaa 

gtgatagccc aaggtggccc aagtaggcac 
gtggaggtgg atggatgatg gaaggaacat 

55 

tcccacctcc cctccttctg cccccgaagc 
tcagcctggc taggctgtgg ttgctcccta 
60 tggttgtgca gtttacccag cagaggcagg 
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ggctcctgtc 


ctcccaaggc 


catcagccag 


10020 


accatttccc 


agaagctcta 


cattccttca 


10080 


tcccaggtgg 


agtcctgcct 


atgcactaag 


10140 


tatcccaatc 


cccctttcac 


atcagtccac 


10200 


actgcacagt 


gatagaactt 


tccacacatg 


10260 


ctagcccata 


gtaggtgctc 


agtgcacatt 


10320 


gaccttggaa 


gcaagaactc 


atatcctccc 


10380 


cggggtggag 


gggatagtgc 


cactgatccc 


10440 


agcaggaggg 


tctaggaggg 


agttctaagt 


10500 


atgggaggag 


gtatggaggt 


ttgtgcacat 


10560 


atgggccaca 


tggcaggcaa 


gaggagctca 


10620 


actggccgcc 


aagggttgat 


tttcttggag 


10680 


ttgaggaacc 


ccagcttctc 


tcctttgact 


10740 


gggatcttgg 


cgataacaat 


gaagaccttt 


10800 


gagatcattc 


tcttccatct 


gagcatggca 


10860 


gtgattgtga 


gggcacttct 


ccaggacttg 


10920 


agaggatgtc 


caagcagaag 


ggtccagatg 


10980 


gaaaggaagc 


accatctgaa 


gcttctctct 


11040 


gctcccaggc 


accctgaccc 


atccatctag 


11100 


gtgtcttaat 


cttaatcacc 


acatagcctg 


11160 


tacttctcct 


ctctaagagc 


caggcagatg 


11220 


tggccactgc 


ccaaacacct 


tgccctctcc 


11280 


ctcctgacca 


ctcaagacaa 


cctgtatgtc 


11340 


atggctgcct 


tccattctcc 


cacaactctg 


11400 


ttatttctgg 


gcagaatgtg 


cccgagcata 


11460 


agagtagaca 


cctagcacaa 


gactggatgg 


11520 


ccctgcaaat 


cacagctcac 


acccaggtct 


11580 


ccaagccagg 


tccttcccta 


tttgtgtccc 


11640 


ctttctcaga 


gggcatacag 


aatagaagga 


11700 


agatgcttga 


caaagacgtg 


cccattcccc 


11760 



-335- 



WO 02/062946 

agacctcacc ctcctgggct gagccccttt 
tgagcataag gacactctta ctgtaaagaa 
5 catatgtgca tagtctttcc tcactaggag 

gatgcttaac ccgagaggac gatgaataca 
agaaaatggg ctggtactgg gacatgtgag 

10 

tttctggtgg ggaaggggct ggatgagggg 
catagctgat gtaggatggt cggggctggc 
15 ttattgcaaa gctcttggat ctgaaggcag 

aggcaagggc cagctggacc ctcctcagtg 
tgtgaggggc tgaggggcca gtgacaggca 

20 

gtctctcctg aggctacaga ggaagctact 
gtctccaccc tttggacagc tgttaccttt 
25 atgacaaacc tcatcagcct cttttgcgtg 

ctctttctgg agagcccatt tgagtccccg 
agtacccagg cctggagggc cctctgactt 

30 

caggaggttt ccagctcaca gaaagcccct 
tttgccaacc tgtttctgtc ccctacaagg 
35 actggtcagc atcactgaat ccaagtccac 

ggggaggagc ctgcctgagg tcaagcagca 
aaactgctga tttcccagcc agggctctaa 

40 

gggagcctcc catcagcctt gcaaagctgg 
ggagtgaaca cggaggtcca gagaggctag 
45 ctcttttcac catatcactg ctgcctctga 

gccaggctcc aagaattctt ttttcttttt 
ctcttgtcac ccaggctgga gtgcagtggc 

50 

ccagattcaa gcaattctcc tgcctcagcc 
cacagccacc accacgccag gctaattttt 
55 gtcggccagg caggtctcga actcctgacc 

agtgctggga ttacaggcgt gagccaccat 
ttattcctta tctcctccca gcacatacac 

60 

tgtgcgagca tgcatggaca cacatgtgcg 
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tagaaccaca 


ggcctcttgc 


tttgtatacc 


11820 


acccattcca 


gtataaaagg 


ctacagattc 


11880 


gatttgcact 


tggcaagagg 


ctcttggaat 


11940 


ggcatttcag 


gaaccatggc 


aggtgggcgg 


12000 


gatagggcac 


tttcagagct 


cctcctgacc 


12060 


tcacctgggt 


gtcgaatccc 


tgacacttgg 


12120 


tttgagcagg 


aaatggcaat 


agggcatcct 


12180 


ctcttgggtt 


ctcccaattt 


gtcccataga 


12240 


gtcacacagg 


agcagctcag 


ctcaatcctg 


123 0 0 


gagggggcct 


tgggcccagt 


gacaggcaga 


12360 


ggcaagcatg 


aggggaggac 


ctagccccag 


12420 


atttttcacc 


tatagaagat 


gttgtgcctg 


12480 


ctgtctggcc 


tcttcgtcat 


ctccaaggat 


12540 


atctggagaa 


tgtaccccaa 


ctccacggtg 


12600 


cagcgaaggt 


gctgccttgg 


atttggccaa 


12660 


cctggagccc 


aggtgtgggc 


tgctgggccc 


12720 


tgtggtggaa 


aggtgggtgc 


tgggctgctg 


12780 


ttttatagat 


tgggaaatgg 


ggccagagac 


12840 


acttgacggc 


aaatccagga 


ccacagccta 


12900 


gcatggtgtg 


cctggtggcc 


cctgtgatcc 


12960 


ccaagtccac 


tttgttaccc 


tcatcttctg 


13020 


attagtgggt 


ctttgactcc 


aagcctggtg 


13080 


agcagaaggc 


agaccactgg 


gtgcctctgg 


13140 


ttttggaagg 


ggcgggggaa 


cggagtctca 


13200 


acaatcgagg 


ctcactgtaa 


cctccgcctc 


13260 


tctcgagtag 


ctaggattac 


aggcacccac 


13320 


gtatttttag 


tagagatggg 


gtttcaccat 


13380 


tctacrtqatc 


ctcccgcctc 


agcctcccaa 


13440 


gcccacaagg 


gttctgacca 


tagtggtttg 


13500 


acacgtgcat 


acacatgcac 


atcacatgca 


13560 


tgcacacaca 


cacatataca 


tatgcttccc 


13620 
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catcaaatgc atgtgatact atgagatgac 
ctgtaaaaca tgctccctca ctctgggact 

5 

tcagatgacc ctaaggctac cttcattctt 
aaacgttagg ccaatactca ccaaagaccc 
10 gaccaagaca gaccaagtcc ccgtcccaca 

tgggagctga aaccccacct gccctctgct 
aagcccacgt gatgcgtgca gacacccagc 

15 

cctctcaggt ccacatccag aggctggagc 
tcttcctgcc agtgcccaca gctgtcacag 
20 taagacaagg gcttgtcttc ccaggaagac 

tcagaaagga aagatatgtg ggggttacaa 
gctgagcgtt cactaagtga cagcccacga 

25 

gagaggaggc caatccaagg agaaaagggt 
cccagaacaa gaggctactg tgctgccagc 
30 aggcccatga tgaccgggca ggattcggac 

gcagagcaag aagagcacga gcactggctt 
tatggtgtgt gtggagtttg gctgtagtgg 

35 

catttaagga attgcaggga ctctattaag 
tggatggatg atgtatggat ggatatagac 
40 ttaactttat cttctcaata accctttttc 

« 

ggctggagta cagtggcacg atctcggccc 
attctcctac ctcagcctcc caagtagctg 

45 

taatttttgt atttttttaa gtatagatga 
aactcctgac ctcaagtgat ccaccaccca 
50 tcagccatca tgcccagccc tcaacaaccc 

ctagcaagct gaggtccaga gaggtcacat 
aatccaggac tcagcctgag ctatgtccaa 

55 

gatggcatag ggctagatat attttaaaat 
cctgaacagt tcaggataag gatgttttat 
60 ccctgaccag aagtgatctc taaggctcat 
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tcccagctgc 


acggctcttc 


ccctcctctg 


13680 


aggtaccccc 


atctgtcaaa 


taggcttgga 


13740 


accagggacc 


catgagattc 


tccagccctc 


13800 


atatgtgcca 


ggcattgagt 


gtagggcagt 


13860 


gagacccacc 


ctgccttccc 


cacgcacacc 


13920 


gtggatctta 


gcaaggaccc 


ctgcctggct 


13980 


gaggcctcac 


cctgccttcc 


ttccctctac 


14040 


tggccttgct 


ctgcttcact 


gtcctagagc 


14100 


cctggagagg 


ggactgccca 


tctgcaaagg 


14160 


aaaaaatggg 


ggcatgaggc 


agcagagagc 


14220 


gaggagcgtg 


tttcattcat 


tcctcacgtt 


14280 


tccagtaggg 


gagccaaggc 


acgggggcaa 


14340 


gaggctgcca 


gagaggaaac 


tccatgggtg 


14400 


tctgaaggcc 


ctggggaaat 


tgcatcacag 


14460 


ccacaggcat 


aggtggaaga 


gagactttca 


14520 


ggagacagga 


cagagaaggc 


ctggggaagc 


14580 


aaacaagtga 


cagagtcatc 


aaagattggg 


14640 


catcaatcga 


tgcatccatt 


gatggatgga 


14700 


gtgcagatat 


acagataaat 


atagctgtat 


14760 


ttttgagact 


gagtctcact 


ctgtcaccca 


14820 


actgcaacct 


ccgcctcccg 


ggttcaagca 


14880 


ggactacagg 


catgtgccac 


catacctggc 


14940 


ggttttgcca 


tcctggccag 


gctggtctca 


15000 


tggcctccca 


aagtgctggg 


attacaggca 


15060 


tttgaggtac 


ttacattctc 


atactacaga 


15120 


cacttctcaa 


ggcgaagcag 


ccaattagga 


15180 


cccacagtct 


tgcccttgtc 


tacccatggg 


15240 


agaaggcaag 


tctctgctca 


ccccaaagat 


15300 


aaaagaaaaa 


aaataaataa 


aaaataaaga 


15360 


ttctttaaag 


aaaaaaaaat 


catattacaa 


15420 
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tctcaacttt 


agaacattat 


ttaaaaaagt 




cactgatttt 


catgttagtg 


acctcatcac 


5 


gttgatggcc 


tctgagtttc 


atttaacatc 




gctgcctgac 


aggaagtctt 


ctcattaaga 


10 


actgaggcca 


accccacttt 


taaggatggg 




tatgagcaat 


aattgttcct 


atttattgac 




ctgcaggtgg 


ctgtattcat 


tttctattga 


15 


cttaaaatga 


cacctgtatt 


accctacagc 




aagggctcaa 


atcaaggtgt 


gtcgccagga 


20 


tctctttcct 


tgccttttcc 


agcttctaga 




ttcttccacc 


ttcaaagcca 


gcatcacaac 




cctttaagga 


cccttatcat 


catactgggc 


25 


tctcaaagct 


gattggaaac 


cttaattcca 




accagagatt 


agaacatggg 


tgtcttctgg 


30 


agatgtcctt 


gaatctcacg 


gccattctgg 




ggaaaaaatg 


aaccttagaa 


gagttacgtg 




ccatgccagg 


ttccccaatg 


ggccctgtgc 


35 


caggctctcc 


agcactgagt 


tggtgccacc 




agcaatcatt 


ggcagacact 


gttggcatta 


40 


agggacttgg 


gtcatggccc 


agcccccctc 




ctggtcacca 


ggacactggt 


agcgggacgg 




tgggcctccc 


cctactccct 


tttcaagaac 


45 


tatggaagtc 


gcttcgtctc 


tcaagcctca 




ttcagaagtg 


aactagcaga 


gaagtgtttc 


50 


cagcaggggc 


ggcgactttt 


ctttcgcaga 




cattgcatct 


caaaggcctg 


ccggtggagc 




gcgacgcaca 


acacaagcaa 


catcagaggt 


55 


V»» C* cm\a« w Ct 


cacfc fccraccc 

*W \m* I— ■ W ^ ^ ^ 


cttctcccta 




tgtaaggggc 


aaggggctaa 


cagtgctgca 


60 


caaggcagca 


atggtcctat 


cccctcaggc 




catctcagca 


cctgacttgg 


ttgaccagaa 
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gccaaatccc 


ccataattct 


tcaagtttga 


15480 


acaccaagtg 


ttcacataat 


gcatatataa 


15540 


aagcagagaa 


ggtgtctttg 


ttaattccta 


15600 


catcagaaag 


ccagatgggg 


gtcttgagtc 


15660 


gaaaactggg 


gccccatacc 


tagctgcagc 


15720 


ctttgctgtg 


ttccaggcac 


tgtgctagtg 


15780 


tgttgtagta 


aattatcaca 


aatttggtgg 


15840 


tctggaggtc 


agcggtttga 


aatgagtctc 


15900 


ctgcatccct 


tccgggggca 


tcaaaaagaa 


15960 


ggctgcctgc 


attccttggc 


ttgtggctcc 


16020 


atcctccaat 


aaccctgtct 


ccctcttcta 


16080 


ccagagagat 


aatccaggag 


aatctcccaa 


16140 


tctgcaacct 


taattcctct 


ctgccgtgtg 


16200 


gggccgtgct 


tctccctact 


gcactagcta 


16260 


gacgcaggtt 


ctgcttttcc 


attttaaaga 


16320 


acttcccatg 


gtctcacagc 


cagtaagtgg 


16380 


cctccccatc 


ggtaaggatg 


gtgccaggga 


16440 


ctcatcacta 


atcagggatg 


gaggtgtggc 


16500 


gacctgcccc 


tagagtgtat 


cctggctaga 


16560 


aggaaaccca 


acagggtgag 


tgacttgcac 


16620 


gaaccagagc 


cccatccagt 


gactcccatg 


16680 


agggagggtg 


gagggagtca 


cacactggtc 


16740 


gtttccccac 


attcatggag 


aacttgagtc 


16800 


ggaagcagcc 


agtcgctcca 


cagctgtgag 


16860 


atgatgatgc 


atgccttgtt 


ccgaatacac 


16920 


ccccgccatc 


ctaccagagt 


gtgattcaag 


16980 


gaagaggttt 


ggccctggct 


ccccagacct 


17040 


crcataQataci 


aaaccacrttc 


ccacfQQcraQc 


17100 


gctagaactg 


cctggcctgg 


gagtggtagc 


17160 


actagaggaa 


ctcccagggc 


cccccaggaa 


17220 


ctacctcagc 


tgaacctggg 


cgcaaccagg 


17280 
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acccatcgcc ttccctgcca gcctccagag 
cccccatgcc ctctggcctc ccttgcctgc 

5 

gatataaggg cagccaggaa ggaaggtctc 
ctcaacc.tac taagacatct ctgagggagc 
10 agggcagctg gggagttgag tcccatagaa 

ccctctgcat gaatcttttt gccctcctgg 
tgcaaggatg acatgatcca ggagtctccc 

15 

aaggttcaca gttgtgacat cgaggttgcc 
tacttgtgct aagtaaataa tgtgagggag 
20 cagtcatctg aggaatagtc ttcaggactt 

gttttttttc cacactgatt cttttttcta 
aagatatagg aaatcagtat ctttccataa 

25 

gtgaaataaa acgcaatttt cccctccaac 
cttgctagct ctgccccagc caccatggct 
30 acccctgcct ctgagccatc acccttgccc 

atttgcagag ctcactcttt gaatctttac 
tatttaatct caatgtcatt taatcctccc 

35 

taatatctcc atttcatgca ggctcagaga 
agtcactgac ggagctgcga tctggaccca 
40 atgcccaaac ttggttggag catagcccct 

gatgagattt gcacatacaa aaggctagag 
cagctcctgg gaacgctgtc ctctcagata 

45 

atctgtggtg gcccagattg ttttgttttg 
ttgagatgga gtctcgctct gttgcccagc 
50 tgcaatctcc gcctccctgg ttcaagtgat 

actataggca cacgccacca cgcccagcta 
caccatattg gtcagcctgg tctcaaactc 

55 

tcccaaagtg ctgggattac aggcgtgagc 
ccgagatagc agtctgctct ctactgcctt 
60 gtatcatttg cccttgcaca gaatatccct 
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caaacccctg 


ggcctgcagc 


cccctggtgc 


17340 


cttctctgat 


caatggtgga 


tgcacagggg 


17400 


cagagcggca 


caggcagggg 


cttcccttgc 


17460 


ctgtgtgtct 


gaatccctgg 


atttgggctg 


17520 


actgtgtaca 


gttcttccct 


ctgaactaga 


17580 


acctcagtgt 


ctccatctgt 


aaagtggagg 


17640 


agcttgaaag 


ctctgagatc 


aggagaattc 


17700 


agtaaaactt 


cagcctcgga 


atttcaaacc 


17760 


agggacctgc 


cttcctcatt 


ttcctaattt 


17820 


ttgcttattt 


gtgagctgca 


atgtattctg 


17880 


tattatttta 


aaggggcact 


ttgtagcagc 


17940 


acagaagcaa 


accccaaaaa 


taacaaacat 


18000 


cccgtggtct 


cacccactgc 


ttcccttcct 


18060 


tccttgaggt 


tccacagaca 


ccccagggac 


18120 


ttcctccacc 


tggattctct 


tcccccaggt 


18180 


tgggccactg 


actctgagcc 


aagtaaatgc 


18240 


atcgatctgt 


gacctagctg 


ctgttgctat 


18300 


agttaagcaa 


gttgccccgg 


acacatggat 


18360 


gactgcaaac 


tgaagagcca 


ctgcctgaca 


18420 


gctctcccaa 


agttgcactt 


tcactgggaa 


18480 


cgatggtcta 


tacagcaaag 


tcagccctca 


18540 


agccatttct 


tacatagttg 


atggctcgat 


18600 


tttcgctttg 


ttttgttttc 


ttttgttttg 


18660 


ccagagtaca 


gtggcactat 


ctcagctcac 


18720 


tctcctatct 


cagcctccca 


agtagctggg 


18780 


atttttatat 


ttttagtaga 


gacaggattg 


18840 


ctgacctcag 


gtgatccacc 


tgcctcagcc 


18900 


cactgtgccc 


agcccaggtt 


ttgaagttgt 


18960 


ataaaatccc 


tgtgtgaagg 


gatgctctca 


19020 


ggggtttgag 


gttctttgaa 


ttctccctct 


19080 
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ttgtcatctc tttcgctgcc acttctggct gtggtcacta gcttggccat agcacctctc 19140 

ttctccactt ctgatctgct gcttctaacc ttctatagat tgcagctggc tttaaaatag 19200 

attgtaaagt gtaaggcatt aggttctgag acagcggcag agagagccat gcaaatgttt 19260 

aggacaaccc agtctttctt tttttttttt tttttttttt ttt 19303 
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